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(54) METHOD AND APPARATUS FOR RETRIEVING, ACCUMULATING, AND SORTING 
TABLE-FORMATTED DATA 



(57) This invention provides a method and appara- 
tus for searching for and tabulating table-format data 
that not only has the functions of a conventional data 
table but also greatly increases the speed of searching 
for and tabulating large amounts of data. The method 
for searching for and tabulating table-format data repre- 
sented as an array of records including fields containing 
field values for each field according to the present in- 
vention comprises: keeping in a storage device, a value 



control table containing field values in the order of field 
value numbers corresponding to field values belonging 
to a specific field, and a field value number- specifying 
information array containing information that specifies 
the field value numbers in the order of records, acquiring 
from the field value number- specifying information ar- 
ray the field value number corresponding to the specific 
record, and obtaining from the field values stored in the 
value control table the field value corresponding to the 
field value number thus acquired. 



FJ6, 5 

ARRAY OF POINTERS TO VALUE VALUE CONTROL TABLE ARRAY OF POINTERS TO RECORDS 

CONTROL TABLE 

CATEGORY START 




LU 



Printed by Jouve, 75001 PARIS (FR) 



1 



EP1 136 918 A1 



2 



Description 

BACKGROUND OF THE INVENTION 
Field of the Invention: 

[0001] The invention relates to a data processing 
method and data processing apparatus for processing 
large amounts of data using a computer or other infor- 
mation processing apparatus, and particularly to a 
method and apparatus for searching for, tabulating and 
sorting table-format data. 

Description of the Prior Art: 

[0002] Conventionally, large amounts of data are ac- 
cumulated and searching and tabulating and other types 
of data processing is performed on the accumulated da- 
ta. This data processing may be done using, for exam- 
ple, a known computer system including a CPU, mem- 
ory, peripheral interface, a hard disk or other auxiliary 
storage device, a display, a printer or other output de- 
vice, a keyboard, a mouse or other input device, and a 
power supply unit connected via a bus, and particularly 
as software that can be run on a readily available com- 
mercial computer system. In order to perform the afore- 
mentioned searching, tabulating or other types of data 
processing, various types of databases that particularly 
store large amounts of data are known. Among various 
types of large amounts of data, there is a particularly 
strong demand to process data that can be expressed 
in a table format. Fig. 1 is a diagram showing an example 
of expressing the data to be processed in a table format. 
Fig. 1 shows an example wherein the sex, age and oc- 
cupation data for a large number of people, e.g. 1 mil- 
lion, are stored in a table. In Fig. 1 , the horizontal rows 
in the table, namely the so-called records, consist of the 
record number, and the sex, age and occupation fields 
corresponding to the record number. The vertical col- 
umns in the table consist of the record number, sex field, 
age field and occupation field. The table indicates that 
the person with the record number of "0" has a sex of 
female, age of 1 8 and occupation of programmer. In the 
following explanation, the data such as "Female," "18" 
and "Programmer" set in the various fields are called 
field values. In addition, in the following explanation, un- 
less otherwise indicated, the table-format data consist- 
ing of 1 million records shown in Fig. 1 is used as a spe- 
cific example of a large amount of data. 
[0003] Whether or not large amounts of data can be 
searched for or tabulated efficiently depends on the for- 
mat in which the large amount of data is stored. Con- 
ventionally, typical known storage techniques include 
the so-called "record-sequential" and "field-sequential" 
storage techniques shown in Figs. 2A and 2B, respec- 
tively. 

[0004] Fig. 2A and Fig. 28 show a representation of 
the data storage format on a storage device, e.g. a hard 



disk. In the case of the record-sequential storage tech- 
nique in Fig. 2A, a set of the field values of sex, age and 
occupation for each record number is stored on disk in 
the order of increasing logical addresses sequentially 

5 for each record number. On the other hand, in the case 
of the field-sequential storage technique in Fig. 28, for 
each field, the field values are stored in record number 
order grouped by field in the direction of increasing log- 
ical addresses. To wit, in the example of Fig. 28, the field 

10 values for the sex field corresponding to record numbers 
"0" through "999999" are arranged in order, and next, 
the field values for the age field are arranged in record 
number order, and then the field values for the occupa- 
tion field are arranged in record number order. 

15 [0005] Inthecaseof the aforementioned prior art, field 
values corresponding to all fields for all record numbers 
are stored as is in a two-dimensional data structure (with 
the record number as one dimension and the other field 
values as one dimension). Hereinafter, such a data 

20 structure in particular shall be referred to as a "data ta- 
ble." In the case of the prior art, when searching for and 
tabulating stored data, this is performed by accessing 
such a data table. 

[0006] In addition to the method of storing the value 
25 of the fields as field values as is, there is also a known 
method of converting the values to codes and storing 
the codes as field values. For example, with respect to 
the sex field, the value "Male" may be converted to "0" 
while the value "Female" is converted to "1" and then 
30 the values "0" or "1" are stored as the field values in- 
stead of "Male" or "Female." Even in this case, there is 
no change to the point that the converted codes are 
stored in a data table as field values. 
[0007] In the case of searching for and tabulating 
35 large amounts of data stored using a data structure of 
the data table type in the aforementioned prior art, there 
is a problem in that the processing time for searching 
and tabulating becomes longer due to the access time 
required to access such data tables. 
40 [0008] In addition, data tables have at least the follow- 
ing intrinsic drawbacks. 

(1 ) The data tables easily become enormous in size 
and cannot be easily separated (physically) into in- 

45 dividual fields. For example, when extracting 
records in which the sex is "Male," the age and oc- 
cupation information is unnecessary, so efficiency 
could be improved if the table could be separated 
into a table containing only the sex fields. In the 
50 case of the field-sequential storage technique 
shown in Fig. 28, while separation into individual 
fields is simple, when large amounts of data are 
handled, the size of the data table still becomes 
enormous, so the actual expansion of a data table 
55 into memory or other fast storage device for the pur- 
pose of tabulating or searching is difficult. 

(2) Data tables cannot be kept in aform with multiple 
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field values sorted simultaneously. For example, in 
the case of the prior art illustrated in Fig. 2A and 
Fig. 2B, the field values for the sex field are ar- 
ranged in record number order in the manner "Fe- 
male, Male, Female, Female. " However, when 
performing searching and tabulating processes, it 
is typically convenient for them to arrange in the 

manner "Female, Female, Female, Male Male. 

" However, in table data, the field values are ar- 
ranged in a specific matrix order, namely record 
number order, so sorting the field values on a spe- 
cific field is not permitted. For this reason, in the 
case of the prior art, it is not possible to select an 
arrangement of field values that is convenient for 
searching and tabulating. 

(3) In a data table, identical values appear over and 
over. For example, in the case of the conventional 
data table given in Fig. 2A and Fig. 2B, at the time 
of extracting records wherein the sex is "'Male' or 
'Man"' (or namely, record numbers), because the 
field value "Male" appears many times, it is neces- 
sary to perform the matching operation "'Male' or 
'Man'" which is the comparison condition with the 
field value of "Male" many times. A single compar- 
ison should be sufficient to make the determination 
of whether there is a match with identical values. 

[0009] In order to increase greatly the speed of 
searching for and tabulating large amounts of data, the 
object of the present invention is to provide a method of 
searching for, tabulating and sorting table-format data 
and an apparatus for implementing said method by pro- 
viding a data control mechanism that both has the func- 
tions of the conventional data table and solves the afore- 
mentioned problems with the data structure based on 
the data table. 

SUMMARY OF THE INVENTION 

[0010] In order to achieve the aforementioned object, 
the method and apparatus for searching for and tabu- 
lating table-format data according to the present inven- 
tion proposes a novel data control mechanism that is 
usable on an ordinary computer system. The data con- 
trol mechanism according to the present invention com- 
prises a value control table and an array of pointers to 
the value control table, as a general rule. 
[0011] Fig. 3 isadiagram used to explain the principle 
of the present invention, showing a value control table 
1 0 and an array of pointers to the value control table 20. 
A value control table 10 is defined to be a table made 
by assigning, for each field in table-format data, an (in- 
tegral) field value number to each field value be longing 
to that field, and the table thus contains the field values 
corresponding to said field value number arranged in or- 
der of the field value numbers ( reference number 11) 
along with a category number (reference number 12) 



which relates to said field value. An array of pointers to 
the value control table 20 is defined to be an array con- 
taining pointers to the field value numbers of the col- 
umns (namely, the fields) in the table-format data, 

5 namely to the value control table 10, arranged in order 
of the record numbers of the table-format data. 
[0012] By combining the array of pointers to the value 
control table 20 with the value control table 10, given a 
certain record number, it is possible to use the array of 

10 pointers to the value control table 20 pertaining to the 
field in question to extract the stored field value number 
corresponding to that record number, and next, extract 
the stored field value corresponding to that field value 
number within the value control table 1 0, and thus obtain 

15 the field value from the record number. Therefore, in the 
same manner as with a conventional data table, it is pos- 
sible to access all data (field values) with coordinates 
consisting of the record number (row) and field (col- 
umn). 

20 [0013] The data control mechanism according to the 
present invention, which includes a value control table 
generated for a certain field within the fields of table- 
format data and an array of pointers to the value control 
table , may also be referred to in particular as an "infor- 
ms mation block" in the following explanation. 

[0014] While conventional data tables offer the inte- 
grated control of all data using the coordinates of the 
rows corresponding to records and columns corre- 
sponding to fields, the information blocks according to 
30 the present invention are characterized in that the data 
are completely separated by column in the table format, 
namely by field. In this manner, by means of the present 
invention, large amounts of data are separated by field, 
so it is possible to load only that data related to those 
35 fields required for searching or tabulating into memory 
or other high-speed storage device, and as a result, the 
access time to the data is reduced, so the searching and 
tabulating processes are speeded up, and even ex- 
tremely large amounts of data can be handled without 
40 adversely affecting performance. 

[0015] In addition, in the case of the information 
blocks according to the present invention, the field val- 
ues are stored in the value control table, and the record 
numbers that indicate the position of the value are as- 
45 sociated with the array of pointers to the value control 
table, so there is no need for the field values to be ar- 
ranged in record number order. Therefore, data can be 
sorted on field values such that it is suited to searching 
and tabulating. Thereby, the determination of whether 
50 or not a field value matching the target value is present 
in the data can be performed at high speed. Further- 
more, corresponding field value numbers are assigned 
to the field values, so even if the field values consist of 
long data or text strings, they can he handled as inte- 
rs gers. 

[0016] Moreover, by means of the present invention, 
all of the field value numbers of the value control table 
10 correspond to different field values, so the number 



25 



30 



35 



40 



45 



50 



3 



5 



EP1 136 918 A1 



6 



of comparison operations between a specific value and 
the field values which arc required to extract a record 
containing a field value having that specific value is no 
more than the number of possible field values, namely 
the number of field value numbers, so the number of 
comparison operations is greatly decreased, thus 
speeding up searching and tabulating. At this time, while 
a location is required to store the results of determining 
whether or not a certain field value matches, for exam- 
ple, the category number 1 2 can be used as this storage 
location. 

[0017] Fig. 4 shows the information blocl^ according 

to the present invention which comprises a value control 
table 1 0 including an array of field values 1 1 containing 
the field values, an array of category numbers 12 con- 
taining the category numbers, and an array of counts 1 4 
containing the counts. The array of counts 14 contains 
numbers which indicate a count of the number of times 
each field value is present within all data in a certain 
field, or in other words, the number of records which 
have a stipulated field value. By preparing such an array 
of counts 14 within the value control table 10, the infor- 
mation "(how many instances of) which data is present? 
" required at the time of searching or tabulating can be 
obtained immediately, thus speeding up searching and 
tabulating. 

[0018] Fig. 5 shows an information block including a 
value control table 1 0, array of pointers to the value con- 
trol table 20 and an array of pointers to records 30. The 
array of pointers to records 30 is defined to be an array 
containing, for each field value number, namely each 
field value, pointers to records that have that field value 
( corresponding to the record number). The number of 
pointers contained in the array of pointers to records 30 
for each field value matches the number of entries in the 
array of counts 1 4 in the value control table 1 0. In addi- 
tion, an array of start positions 13 which specifies the 
starting address of a group of pointers for each field val- 
ue may be provided within the array of pointers to 
records 30 . By providing such an array of pointers to 
records 30 in the information block in this manner, a set 
of records that have a particular field value for a certain 
field can be extracted quickly. The count ( reference 
number 14) and start positions (reference number 13) 
of pointers stored in the array of pointers to records 30 
are set in the value control table 1 0, so the fact that the 
values and count are present in the information block as 
is such that they are usable at the time of tabulating is 
an advantage. 

[0019] Here follows an explanation of the method of 
searching for and tabulating table-format data according 

to the present invention. Note that in the following ex- 
planation, the individual field information refers to the 
aforementioned "information block," and the field value 
number-specifying information array refers to the afore- 
mentioned "array of pointers to the value control table" 
while the record identifying information array refers to 
the aforementioned "array of pointers to records. " 



[0020] When table-formal data is represented as an 
array of records including a plurality of fields containing 
field values for each field, the method of extracting from 
the table-format data the field value corresponding to a 
5 specific field and a specific record according to the 
present invention comprises the steps of: 

keeping in a storage device, for each individual 
field, a value control table containing field values 
10 which are located in order of field value number 
each corresponding to the field value belonging to 
the specific field, and a field value number-specify- 
ing information array containing information that 
specifies the field value numbers in the order of the 
15 records, 

acquiring from the field value number- specifying in- 
formation array the field value number correspond- 
ing to the specific record, and 

20 

obtaining from the field values stored in the value 
control table the field value corresponding to the 
field value number acquired as above. 

25 [0021] In addition, with the method of obtaining field 

values according to the present invention, in order to 
categorize the field values corresponding to the field val- 
ue number, category numbers are stored in the value 
control table corresponding to the field value number, 
30 and the category numbers are accessed at the time of 
obtaining the field value corresponding to the field value 
number. 

[0022] When table-format data is represented as an 
array of records including field values with respect to a 
35 field associated with a search condition, a single-search 
method of searching through said table-format data for 
field values that match a specific search condition com- 
prises the steps of: 

40 keeping in a storage device, for each individual 
field, individual field information such that includes 
a value control table containing field values which 
is located in order of field value numbers each cor- 
responding to the field value belonging to the field 
45 associated with the search condition, a field value 
number- specifying information array containing in- 
formation that specifies said field value numbers in 
the order of said records, and a record identification 
information array storing in exclusive areas for each 
50 of said field value numbers one or more pieces of 
record identification information related to identical 
field value numbers, and said value control table in- 
cludes, for each of said field value numbers, record 
identification information-specifying information 
55 that indicates the area where said one or more piec- 
es of record identification information related to 
identical field value numbers in said record identifi- 
cation information array, 
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using said record identification information-specify- 
ing information corresponding to field value num- 
bers related to field values within the field values 
contained in said value control table that match said 
search conditions, to acquire record identification 
information from said record identification informa- 
tion array that matches said search conditions. 

[0023] In addition, the multiple-field search method 
according to the present invention comprises the steps 
of: 

l^eeping in a storage device the result set of records 
that match the search conditions obtained by the 
single-field search method according to the present 
invention, 

selecting separate individual field information re- 
garding fields related to separate search conditions, 

acquiring from the field value number- specifying in- 
formation array regarding the separate individual 
field information the field value number correspond- 
ing the pieces of record identification information 
that match the search conditions set in the result 
set, 

regarding the separate individual field information, 
determining whether or not the field values identi- 
fied by the extracted field value numbers match the 
separate search conditions, and 

regarding the separate individual field information, 
if the field values identified by the extracted field val- 
ue numbers match the separate search conditions, 
extracting the pieces of record identification infor- 
mation corresponding to the field value numbers as 
pieces of record identification information that 
match the separate search conditions, and 

[0024] Alternately, as a variation to the multiple-field 
search method, it is possible to implement a so-called 
OR search. In more detail, this method comprises the 
steps of: 

keeping in a storage device the result set of records 
that match the search conditions, 

regarding other individual field informal ion related 
to other search conditions, 

using field values within the field values stored in 
other value control tables that match the search 
conditions and record identification information- 
specifying information corresponding to related 
field values to extract from a record identification in- 
formation array the records that match the other 
search conditions, and store the records that match 



the search conditions in a specified other recordset, 

if necessary, regarding still other search conditions, 
using still other record identification information- 

5 specifying information to extract records that match 

still other search conditions, and repealing the stor- 
age of still other result sets, and 

obtaining a final result set by eliminating duplicate 
10 records from the result sets thus obtained. 

[0025] When table-format data is represented as an 
array of records including a plurality of fields containing 
field values for each field, the method of tabulating the 
15 table-format data by each field value according to the 
present invention comprises the steps of: 

if n represents an integer equal to 1 or greater, for 
each of n fields used in tabulation, keeping in astor- 
20 age device individual field information including a 
value control table containing field values for that 
field corresponding to a field value number that 
uniquely identifies the field value, which is a field 
value number that is common to the various fields 
25 and has a stipulated order from an initial value, and 
a field value number-specifying information array 
containing information that specifies the field value 
numbers in the order of the records, 

30 if i represents an integer in the range 1 <i<n, for the 
i^*^ individual information field, the total number of 
the field value numbers is represented by Nj, kj rep- 
resents an integer in the range 0 <kj<Nj-1 , M repre- 
sents an integer equal to 1 or greater, and if m is an 
35 integer in the range 1 <m<IVI, then initializing ele- 
ments Pm(^^1 ' ... kj, ... k^) of n-dimensional IVI da- 
ta spaces having a size of N i xN2X...xNjX...xNn, 

for the n individual information fields, when j repre- 
40 sents an integer in the range 0<j<(total number of 
records)-1, extracting the respective field value 
numbers stored in the j^^ position in each field value 
number- specifying information array, and when the 
field value number extracted from the i^*^ individual 
45 information field is represented by q;, identifying the 
elements Pm(qi, qi- of the data space, 
and 

processing the identified values of the elements 
50 (qi,q2, ...,qi, ...,qn). 

[0026] When table-format data is represented as an 
array of records including a plurality of fields containing 
field values for each field, the method of tabulating the 
55 table-format data by the category of field values, 

the method being characterized in comprising the 
steps of: 
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if n represents an integer equal to 1 or greater, for 
each of n fields used in tabulation, keeping in a stor- 
age device individual field information including a 
value control table containing field values for that 
field and the category number of the field value cor- 
responding to a field value number that uniquely 
identifies the field value, which is a field value 
number that is common to the various fields and has 
a stipulated order from an initial value, and a field 
value number- specifying information array contain- 
ing information that specifies the field value num- 
bers in the order of the records, 

if i represents an integer in the range 1 <i<n, for the 
i^"^ individual information field, the total number of 
either the field value numbers or the category num- 
bers is represented by Nj, kj represents an integer 
in the range 0<kj<Ni-1, M represents an integer 
equal to 1 or greater, and if m is an integer in the 
range 1 <m<M, then initializing elements Pm(ki, 
k2, kj, kp) of n-dimensional M data spaces 
having a size of xN2X...xNjX...xNp, 

for the n individual information fields, when j repre- 
sents an integer in the range 0<j<(total number of 
records)-1, extracting the respective field value 
numbers stored in thef^ position in each field value 
number- specifying information array, and when the 
field value number extracted from the i^^ individual 
information field or the category number stored cor- 
responding to the field value number in the value 
control table of the i^*^ individual information field is 
represented by qj, identifying the elements Pm(qi, 
q2, qj, qn) of the data space, and 

processing the identified values of the elements 

(qi. qn)- 

[0027] With the method of tabulating counts in partic- 
ular according to the present invention, M=1 is true, and 
the step of processing the value of the identified element 
includes adding 1 to the current value of the element 

[0028] In addition, with the method of tabulating sta- 
tistical quantities according to the present invention, the 
step of processing the value of the identified element 
P^ comprises: for at least one element P^^ among the 
M elements P^, 



updating the current value of the element P^^ and 
the value of the element P^^ in combination with the 
field value thus obtained. 

5 [0029] With the present invention, the information that 
specifies the field value number may be the field value 
number itself. 

[0030] Alternately, in order to implement the so-called 
multi-answer fields wherein multiple field values are al- 

10 located to one field of a certain record, with the present 
invention, the information that specifies the field value 
number may be a binary value wherein 1 bit is allocated 
to each field value number, thus setting whether or not 
it is set. 

15 [0031] In addition, when table-format data is repre- 
sented as an array of records including a plurality of 
fields containing field values for each field, the appara- 
tus for searching for and tabulating the table-format data 
according to the present invention comprises: 

20 

a storage device for keeping, for each individual 
field, a value control table containing field values for 
that field corresponding to a field value number that 
uniquely identifies the field value, which is a field 
25 value number that is common to the various fields 
and has a stipulated order from an initial value, and 
a field value number-specifying information array 
containing information that specifies the field value 
numbers in the order of the records, 

30 

means of acquiring from the field value number- 
specifying information array kept on the storage de- 
vice the field value number corresponding to the 
specific record, and 

35 

means of obtaining from the field values stored in 
the value control table kept on the storage device 
the field value corresponding to the field value 
number acquired as above. 

40 

[0032] When table-format data is represented as an 
array of records including a plurality of fields containing 
field values for each field, the storage medium upon 
which is recorded a program for searching for and tab- 
45 ulating the table-format data according to the present 
invention is recorded with a program characterized in 
comprising: 

a step of keeping in a storage device, for each indi- 
vidual field, a value control table containing field val- 
ues for that field corresponding to a field value 
number that uniquely identifies the field value, 
which is a field value number that is common to the 
various fields and has a stipulated order from an in- 
itial value, and a field value number-specifying in- 
formation array containing information that speci- 
fies the field value numbers in the order of the 
records. 



for separate individual field information kept in a 50 
storage device, acquiring the field value numbers 
stored in the j position in the field value number- 
specifying information array, 

from among the field values stored in the value con- 55 
trol table of the separate individual field information, 
acquiring the field value corresponding to the field 
value number thus acquired, and 
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a step of acquiring from tine field value number- 
specifying information array kept on the storage de- 
vice the field value number corresponding to the 
specific record, and 

a step of obtaining from the field values stored in 
the value control table kept on the storage device 
the field value corresponding to the field value 
number acquired as above. 

[0033] The present invention also proposes a sorting 
method whereby an array of record identification infor- 
mation, e.g. record numbers, specifying records includ- 
ing a plurality of fields containing field values corre- 
sponding to fields of information is rearranged on a spe- 
cific field. With the sorting method according to the 
present invention, an array of pointers to the value con- 
trol table is formed wherein, for each record, record 
identification information is associated with field value 
number corresponding to the field values of a certain 
field. Next, for each of the field value numbers, the stor- 
age location after reordering said record identification 
information is defined. Said record identification infor- 
mation is sequentially extracted from the array, and said 
field value number corresponding to said record identi- 
fication information thus extracted is determined, the 
record identification information thus extracted is stored 
in said storage location according to the record identifi- 
cation information-specifying information correspond- 
ing to the field value number thus determined, and the 
storage location where the record identification informa- 
tion is to be stored is updated in order to store the next 
record identification information. 
[0034] A preferred embodiment of the sorting method 
according to the present invention comprises the steps 
of: keeping in a storage device individual field informa- 
tion including a value control table containing field val- 
ues in the order of field value numbers corresponding 
to field values for a field value associated with search 
conditions, and a field value number-specifying informa- 
tion array containing information that specifies field val- 
ue numbers in the order of the records, where the value 
control table further includes record identification infor- 
mation-specifying information that, for each field value 
number, indicates the area in said record identification 
information-specifying information array where said one 
or more pieces of record identification information re- 
garding identical field value numbers are stored, and is 
constituted such that, record identification information is 
stored at storage locations according to the record iden- 
tification information-specifying information. 
[0035] Moreover, the objects of the present invention 
may be achieved by an apparatus for implementing the 
aforementioned methods, a computer-readable storage 
medium containing a program according to this method, 
or a computer-loadable program product according to 
the method in question. 



BRIEF EXPLANATION OF THE DRAWINGS 

[0036] This and other objects of the present invention 
will be made clear in reference to the appended draw- 
5 ings and embodiments. Here, 

[0037] Fig. 1 is an explanatory diagram illustrating 
typical table-format data. 

[0038] Figs. 2A and 2B are explanatory diagrams il- 
lustrating table-format data storage techniques in the 

10 prior art. 

[0039] Fig. 3 is an explanatory diagram illustrating the 
principle of the present invention. 
[0040] Fig. 4 is an explanatory diagram illustrating an 
information block according to the present invention. 

15 [0041] Fig. 5 is an explanatory diagram illustrating an 
information block according to the present invention. 
[0042] Fig. 6 is an explanatory diagram illustrating an 
information block regarding "sex" used in an embodi- 
ment of the present invention. 

20 [0043] Fig. 7 is an explanatory diagram illustrating an 
information block regarding "age" used in an embodi- 
ment of the present invention. 

[0044] Fig. 8 is an explanatory diagram illustrating an 
information block regarding "sex" used in an embodi- 
es ment of the present invention. 

[0045] Fig. 9 is a flowchart of the operation of the 
method of searching within a single field according to 
Embodiment 1 of the present invention. 
[0046] Fig. 10 is an explanatory diagram illustrating 
30 an information block according to Embodiment 1 of the 
present invention. 

[0047] Fig. 11 is an explanatory diagram illustrating 
an information block according to Embodiment 1 of the 
present invention. 
35 [0048] Fig. 12 is a flowchart of the operation of the 
method of searching upon an AND of multiple fields ac- 
cording to Embodiment 2 of the present invention. 
[0049] Fig. 13 is an explanatory diagram illustrating 
an information block according to Embodiment 2 of the 
40 present invention. 

[0050] Fig. 14 is an explanatory diagram illustrating 
an information block according to Embodiment 2 of the 
present invention. 

[0051] Fig. 15 is an explanatory diagram illustrating 
45 the method of multiple-field Boolean operation search- 
ing using bit flags according to Embodiment 3 of the 
present invention. 

[0052] Fig. 16 is an explanatory diagram illustrating 
the method of multiple-field Boolean operation search- 
so ing using bit flags according to Embodiment 3 of the 
present invention. 

[0053] Fig. 17 is a flowchart of the operation of the 
method of tabulating according to Embodiment 5 of the 
present invention. 
55 [0054] Fig. 1 8 is a conceptual explanatory diagram of 
Embodiment 6 of the present invention. 
[0055] Fig. 19 is a flowchart of the operation of Em- 
bodiment 6 of the present invention. 
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[0056] Fig. 20 is a flowcliart of tlie operation of cross- 
tabulating according to Embodiment 6 of the present in- 
vention. 

[0057] Fig. 21 is an explanatory diagram illustrating 
an information block according to Embodiment 8 of the 
present invention. 

[0058] Fig. 22 is a flowchart of the operation of cross- 
tabulating according to Embodiment 9 of the present in- 
vention. 

[0059] Figs. 23A and 23B are conceptual explanatory 
diagrams of a cross-tabulation table. 
[0060] Fig. 24 is an explanatory diagram illustrating 
multi-answer type fields. 

[0061] Fig. 25 is an explanatory diagram illustrating 
an information block of a type compatible with multi-an- 
swer type fields according to Embodiment 10 of the 
present invention. 

[0062] Fig. 26 is an explanatory diagram illustrating 
the method of handling special values according to Em- 
bodiment 11 of the present invention. 
[0063] Fig. 27 is a flowchart of the operation of the 
method of searching upon multiple fields according to 
Embodiment 12 of the present invention . 
[0064] Fig. 28 is a structural diagram of a searching 
and tabulating system for table-format data based on 
one embodiment of the present invention. 
[0065] Fig. 29 is an explanatory diagram illustrating 
the method of constructing an information block. 
[0066] Fig. 30 is an explanatory diagram illustrating 
the preparation for data population and initialization. 
[0067] Fig. 31 is an explanatory diagram illustrating 
the first pass of data population. 
[0068] Fig. 32 is an explanatory diagram illustrating 
the second pass of data population. 
[0069] Fig. 33 is an explanatory diagram illustrating 
the third pass of data population. 
[0070] Fig. 34 is an explanatory diagram illustrating 
the third pass of data population. 
[0071] Fig. 35 is an explanatory diagram illustrating 
the third pass of data population. 
[0072] Fig. 36 is an explanatory diagram illustrating 
the addition of data to an information block. 
[0073] Fig. 37 is a diagram illustrating the structure of 
an information block according to another embodiment 
of the present invention. 

[0074] Fig. 38 is an explanatory diagram illustrating 
the initial state of sorting according to Embodiment 13 
of the present invention. 

[0075] Fig. 39 is an explanatory diagram illustrating 
the first step of sorting according to Embodiment 13 of 
the present invention. 

[0076] Fig. 40 is an explanatory diagram illustrating 
the second step of sorting according to Embodiment 13 
of the present invention. 

[0077] Fig. 41 is an explanatory diagram illustrating 
the final state of sorting according to Embodiment 13 of 
the present invention. 

[0078] Fig. 42 is an explanatory diagram illustrating 



sorting on a partial set. 

[0079] Fig. 43 is an explanatory diagram illustrating 
the post-processing of sorting on a partial set. 
[0080] Fig. 44 is an explanatory diagram illustrating 
5 the 1 million records of data used in the searching and 
tabulating tests. 

[0081] Fig. 45 is an explanatory diagram illustrating 
the results of measurement of the searching and tabu- 
lating tests on 1 million records of data. 

10 [0082] Figs. 46A and 46B arc flowcharts illustrating 
the OR search process on multiple fields as a variation 
of Embodiment 2 of the present invention. 
[0083] Fig. 47 is a flowchart illustrating the searching 
process according to Embodiment 3 of the present in- 

15 vention. 

[0084] Fig. 48 is a flowchart illustrating the tabulating 
process according to Embodiment 4 of the present in- 
vention. 

[0085] Fig. 49 is a flowchart illustrating the tabulating 
20 process according to Embodiment 7 of the present in- 
vention. 

[0086] Fig. 50 is a flowchart illustrating the sorting 
process according to Embodiment 13 of the present in- 
vention. 

25 

DESCRIPTION OF THE PREFERRED EMBODIMENT 

[0087] In order for the present invention to be better 
understood, we shall use the table-format data illustrat- 
30 ed in Fig. 1 as an example of data, and make a detailed 
description of the search method, tabulating method 
and sorting method according to the present invention 
in various embodiments. The data illustrated in the ex- 
ample of Fig. 1 includes the fields of "sex," "age" and 
35 "occupation," so as shown in the individual figures in 
Figs. 6-8, the information blocks obtained are an infor- 
mation block regarding "sex," an information block re- 
garding "age" and an information block regarding "oc- 
cupation." The following description assumes a situa- 
40 tion wherein these information blocks are obtained. 
Note that while one technique of constructing the infor- 
mation blocks will be described later, note that the 
present invention is in no way limited by the method of 
constructing the information blocks. 
45 [0088] As described later, the apparatus for searching 
for and tabulating table-format data according to an em- 
bodiment of the present invention is provided with the 
structure shown in Fig. 28. As shown in Fig. 28, the ap- 
paratus for searching for and tabulating table-format da- 
50 ta is implemented by means of a computer system such 
as an ordinary personal computer. This computer sys- 
tem includes a CPU 1 00 that executes programs to con- 
trol the entire system and its individual constituent com- 
ponents, ROM (Read Only Memory) 1 1 0 that stores pro- 
55 grams and the like, RAM (Random Access Memory) 1 20 
that stores working data and the like, a hard disk storage 
device 130, a display device 140, and a keyboard, 
mouse or other input device 150. The CPU 100, ROM 
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1 1 0, RAM 1 20, and the like are connected to each other 
via a bus 1 60. Other components that may also be con- 
nected to the bus include a CD-ROM drive (not shown) 
for accessing CD-ROM discs, an external network (not 
shown) and an interface (not shown) provided to con- 
nected external terminals, and the like. 
[0089] The program that performs the searching and 
tabulating (and also depending on the case, sorting) of 
table-format data may be contained on CD-ROM (not 
shown) and read by a CD-ROM drive (not shown), or 
stored in advance in ROM 110. In addition, once read 
from CD-ROM, the program may also be stored in aspe- 
cific area of the hard disk storage device 1 30. Alternate- 
ly, the aforementioned program may also be supplied 
from outside via the network, external terminals or inter- 
face (none of these are shown). 
[0090] In addition, in the aforementioned search and 
tabulating apparatus, in order to execute searching and 
tabulating (and also depending on the case, sorting) 
processes on table-format data, as described later, it is 
necessary to generate an information block of a stipu- 
lated data format based on the table-format data. This 
information block generation program may be similarly 
contained on CD-ROM, stored in ROM 110, or stored 
on the hard disk storage device 130. Alternately, the 
aforementioned programs may also be supplied from 
outside via the network, external terminals or interface 
(none of these are shown). In addition, in this embodi- 
ment, the data ( information blocks) generated by the 
aforementioned information block generating program 
that generates the information blocks are stored in RAM 
1 20 or in a specific area of the hard disk storage device 
1 30 . Here follows a description of the method of search- 
ing on a single field according to Embodiment 1 of the 
present invention, in the case of searching for records 
wherein the value of the "age" field is "1 6" or "1 9." Fig. 
9 is a flowchart of the operation of the method of search- 
ing within a single field. This is implemented by the CPU 
100 executing the search program acquired by the 
aforementioned procedure and stored in a stipulated ar- 
ea. 

[0091] First, from among the information blocks re- 
garding table-format data, select the information block 
regarding "age" shown in Fig. 7 as the specific informa- 
tion block (Step 100). 

[0092] Next, set "1" in the category number of those 

rows in which the field value within the value control ta- 
ble of the specific information block matches "16" or "1 9" 
which is the aforementioned search condition, and set 
"0" in the category number of other rows (Step 102). In 
the case of this example as shown in Fig. 10, "1" is set 
in the category number of those rows corresponding to 
a field value number of "0" and field value number of "3." 
[0093] Next, the start positions and counts corre- 
sponding to the rows wherein the category number is 
set to "1 " (namely, the rows to which the field value num- 
bers of "0" and "3" are applied) are acquired as pointer 
extraction information (Step 1 04). In the case of this ex- 



ample, the field value number of "0" has acorresponding 
start position of "0" and count of "45898." On the other 
hand, the field value number of "3" has a corresponding 
start position of "238137" and count of "189653." 

5 [0094] By extracting from the array of pointers to 
records the number of pointers specified by the afore- 
mentioned start position and count, the record numbers 
that represents pointers to the records matching the 
aforementioned search conditions are extracted (Step 

10 106). In the case of this example, as shown in Fig. 10, 
one can see that the pointers to records corresponding 
to the field value number of "0" are stored in the array 
of pointers to records at locations from the start position 
of "0," or namely the beginning, up until the 45898^^ |o- 

15 cation, while the pointers to records corresponding to 
the field value number of "3" are stored in the array of 
pointers to records at 1 89653 locations starting from the 
23831 37^^^ location. For example, when accessing the 
table-format data in Fig. 1, the "age" corresponding to 

20 the record with the last record number of "999999" is 
"1 6," so as shown in Fig. 1 1 , the last pointer among the 
stored pointers within the array of record pointers which 
correspond to a field value number of "0," or namely an 
"age" of "16," is "999999." 

25 [0095] Finally, in order to be used in subsequent 
processing, an array of the extracted record numbers is 
created as a result set and saved (Step 1 08). 
[0096] With the present invention, it is possible to im- 
plement not only searches on a single field as described 

30 above, but also searches on an AND of multiple fields. 
Here follows a description of the method of performing 
searches on an AND of multiple fields according to Em- 
bodiment 2 of the present invention. In this example, we 
shall consider the case of obtaining a set of records that 

35 satisfy both the first search condition of the "age" being 
"1 6" or "1 9" and the second search condition of the "oc- 
cupation" being "Student." Fig. 12 is a flowchart of the 
operation of the method of searching upon an AND of 
multiple fields. 

40 [0097] As described previously, for the first specific in- 
formation block which is the information block regarding 
"age" which is the first field, a result set of records 
wherein the "age" is "16" or "19" is obtained by means 
of the processing according to Embodiment 1 (Step 

45 120). Therefore, the processing of this Step 120 corre- 
sponds roughly to that shown in Fig. 9. 
[0098] Next, the information block regarding "occupa- 
tion" which is the second field shown in Fig. 8 is selected 
as the second specific information block (Step 122). 

50 [0099] Next, set "1" in the category number of those 
rows in which the field value within the value control ta- 
ble of the specific information block matches "Student" 
which is the aforementioned search condition, and set 
"0" in the category number of other rows (Step 124). In 

55 the case of this example as shown in Fig. 13, "1 " is set 
in the category number of those rows corresponding to 
a field value number of "0," and "0" is set in other rows. 
[0100] Next, sequentially extract from the result set 
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for the first search condition those record numbers that 
represent pointers to records (Step 126). For example, 
as shown in Fig. 14, the record number "999999" is ex- 
tracted. 

[0101] Next, regarding the second specific informa- 
tion block, extract from the array of pointers to the value 
control table the field value numbers corresponding to 
the record number obtained with respect to the afore- 
mentioned first search condition (Step 128). For exam- 
ple, as shown in Fig. 14, the field value number of "0" 
corresponding to the record number of "999999" is ex- 
tracted. 

[0102] Next, a decision is made as to whether or not 
"1" is set in the category number corresponding to the 
field value number extracted with respect to the second 
specific information block (Step 130). For example, as 
shown in Fig. 14, one can see that "1" is set in the cat- 
egory number corresponding to the field value number 
of "0." 

[0103] In the case that "1" is set in the category 
number, add pointers to records corresponding to loca- 
tions within the array of pointers to the value control ta- 
ble where pointers indicating the field value number in 
question where "1 " is set in the category number, for ex- 
ample, record numbers, to the final result set (Step 1 32). 
For example, as shown in Fig. 14, the record number 
"999999" is added to the final result set. 
[01 04] I n the case that the category number is "0" then 
the final result set is not updated. 
[0105] Note that as would be easily understandable 
to a person skilled in the art, the aforementioned method 
of searching upon an AND of multiple fields can be ap- 
plied to searches other than AND searches, so such var- 
iations as a method of searching upon an OR of multiple 
fields, for example, would be possible. Fig. 46A is a flow- 
chart illustrating one example of the processing of an 
OR search process on multiple fields. This process is 
also implemented by the CPU 1 GO executing a program 
stored in a stipulated area. As shown in Fig. 46, first, 
after the result set is obtained with respect to the first 
search condition (Step 4601), an information block for 
the second search condition is selected (Step 4602). 
Next, regarding this information block, a category 
number is set with respect to the second search condi- 
tion (Step 4603). While skipping record numbers con- 
tained in the result set from the first search condition, 
the array of pointers to the value control table is scanned 
sequentially with respect to the second specific informa- 
tion block (Step 4604). In more detail, the record num- 
bers wherein the category number was made "1" re- 
garding the second search condition, and a decision is 
made as to whether or notthis record number was found 
within the result set according to the first search condi- 
tion (Steps 461 1 -461 5). If the number is not found within 
the result set according to the first search condition, that 
number is added to the result set (Step 461 4). After this 
process is complete, a second result set is generated 
by combining the record numbers stored in the result 



set from the first search condition and the record num- 
bers belonging to the field value numbers for which the 
category number is set with respect to the second infor- 
mation block (Step 4615), and this can be provided as 

5 output. 

[0106] Alternately, the process shown in Fig. 46B may 
be executed. In this example, after the first result set is 
obtained based on the search conditions regarding the 
first specific information, independently thereof, a sec- 

10 ond result set is obtained based on the second search 
conditions regarding the second specific information 
(Steps 4621 -4624), and an OR of the first result set and 
second result set (Step 4625) is performed using a bit- 
map (Step 4626), and a new result set is created based 

15 on this (Step 4627). Note that in the process of Fig. 46B, 
steps 4602 and 4603 correspond to Steps 4621 and 
4622 of Fig. 46A, and step 4625 corresponds to Step 
4601 of Fig. 46A. 

[0107] Figs. 15 and 16 are explanatory diagrams il- 

20 lustrating the method of multiple-field Boolean operation 
searching using bit flags according to Embodiment 3 of 
the present invention, illustrating the case of performing 
a search under the same search conditions as the 
search according to the aforementioned Embodiment 2 

25 of the present invention. Multiple-field Boolean opera- 
tion searching using bit flags is defined to mean a search 
wherein the search conditions are expressed by a 
Boolean operation among search conditions for each 
field. In this case, as in Embodiment 1, a result set ob- 

30 tained by means of a search on a single field should not 
be constructed of an array of record numbers but rather 
it is more advantageous for the result set to be construct- 
ed in the form of bit flags. Namely, in accordance with 
the process illustrated in Fig. 47, the result set is gen- 

35 erated by allocating one bit each to all of the records, 
and a bit value of "1" or "0" expresses whether or not 
each record matches the search conditions. In more de- 
tail, in the same manner as in the other embodiments, 
an information block containing field values pertaining 

40 to the search condition is selected (Step 4701 ), and then 
the category number is set to "1 " on rows that match the 
search conditions (Step 4702). Next, the corresponding 
category number is accessed for each record and the 
bit value to be stored in the result set is determined 

45 (Steps 4703-4707). By forming the result set in this man- 
ner, the size of the result set for each field corresponds 
to the number of records in the table-format data, so the 
size of the result set is identical for each field, and as a 
result, it is simple to perform Boolean operations, e.g., 

50 AND, OR and XOR, on elements in the result set. 

[0108] In this example, the result set A shown in Fig. 
1 5 and the result set B shown in Fig. 1 6 are joined under 
AND conditions to obtain the desired search result set 
in bit flag format. In addition, the search result set in bit 

55 flag format thus obtained can be converted to a result 
set in the format of an array of pointers to records, and 
thus combined with the aforementioned method of 
searching on multiple fields according to Embodiment 2 
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of the present invention. 

[0109] Next, we shall add an explanation of the meth- 
od of tabulating various types of table-format data ac- 
cording to the present invention. The tabulating method 
according to Embodiment 4 of the present invention 
comprises counting the number of records that have a 
specific field value in a specific field. In Embodiment 4 
of the present invention, we shall consider the case of 
counting the number of records that have the field value 
of "Male" or that have the field value of "Female" in the 
"sex" field. As illustrated in Fig. 6, according to a pre- 
ferred embodiment of the present invention, the infor- 
mation block regarding "sex" contains a count of the 
records that contain the field value of "Male" (its value 
being "632564") and a count of the records that contain 
the field value of "Female" (its value being "367436"), 
so a simple tabulation of the number of records can be 
obtained immediately by accessing the array of counts 
within the information block. 

[0110] In addition, by combining the category num- 
bers described regarding the method of searching on 
single fields with the method of tabulating according to 

the aforementioned Embodiment 4 of the present inven- 
tion, counting the number of records can be performed 
easily even in the case of more complicated conditions. 
For example, in the method of searching for records in 
which the value of the "age" field is "16" or "19" de- 
scribed in Embodiment 1 of the present invention, it is 
possible to tabulate the counts corresponding to field 
value numbers in which "1" is set as the category 
number, and thus tabulate the number of records match- 
ing the search conditions. In this manner, by using the 
category number, even in the case that the value control 
table is of a large size or in the case that complex con- 
ditions are given, the count can be found efficiently. 
More generally, as shown in Fig. 48, it is sufficient to find 
the field value numbers in which "1" is set in the category 
number (Step 4802) and then add the corresponding 
counts (Step 4803). 

[0111] Next, we shall provide additional explanation 
of Embodiment 5 of the present invention. In this em- 
bodiment, we shall calculate the average age of the 
"Males." The average age can be calculated by the for- 
mula (total "age" of the males)/(count of males), and al- 
so the count of males can be obtained by the tabulation 
method described in the aforementioned Embodiment 
4 of the present invention. Therefore, in this embodi- 
ment, it comes down to the problem of finding the total 
"age" of the males. Fig. 1 7 is a flowchart illustrating the 
operation of Embodiment 5 of the present invention. In 
the same manner as in other embodiments, t his proc- 
ess is also implemented by the CPU 100 executing a 
program stored in a stipulated area. 
[0112] First, the information block regarding "sex" as 
shown in Fig. 6 is selected as the first information block 
(Step 140), and the field value numbers of "0" corre- 
sponding to the field value of "Male" are detected from 
within the value control table of the specific information 



block (Step 142). Next, the count corresponding to the 
field value number of "0" is "632564" so the total number 
of males is determined to be 632564, and also, the start 
position corresponding to the field value number of "0" 

5 is "0" so the pointers to records wherein the sex is male 
arc determined to be stored in the locations starting from 
the beginning until the 632564 location, and thus a list 
of the pointers to these records, namely, an array of 
record numbers is kept as the result set (Step 146). 

10 [0113] Next, the information block regarding "age" il- 
lustrated in Fig. 7 is selected as the second specific in- 
formation block (Step 1 48) and from the array of pointers 
to the value control table of the second specific informa- 
tion block, the field value number corresponding to the 

15 record specified in the result set regarding the first spe- 
cific information block is extracted (Step 150), and the 
field value related to the extracted field value number, 
namely the "age" is extracted (Step 152). Finally, find 
the total age by sequentially adding the extracted "age" 

20 values (Step 154), and repeat steps 150, 152 and 154 
until all of the specified records in the aforementioned 
result set are processed (Step 156). The total age thus 
obtained is divided by the count to find the average age 
(Step 158). 

25 [0114] Next, we shall add an explanation of Embodi- 
ment 6 of the present invention. In this example, we find 
the average age of the male students and the average 
age of female students. Fig. 1 8 is a conceptual explan- 
atory diagram of Embodiment 6 of the present invention, 

30 while Fig. 1 9 is a flowchart of the operation of Embodi- 
ment 6 of the present invention. 
[0115] In this embodiment, the tabulation is per- 
formed by first selecting the information block regarding 
"occupation" as the first information block (Step 170), 

35 and using the search condition of "occupation is stu- 
dent" to create from among all records a result set con- 
taining the records wherein the "occupation is student" 
(Step 172). 

[0116] Next, select the information block regarding 
40 "sex" as the second information block and also select 
the information block regarding "age" as the third infor- 
mation block (Step 1 74), and sequentially extract point- 
ers to records from the beginning of the result set (Step 
176). 

45 [Oil 7] Using the extracted pointers to records, the ar- 
ray of pointers to the value control table of the second 
information block is accessed to get the sex correspond- 
ing to the extracted pointers to records, and also, the 
array of pointers to the value control table of the third 

50 information block is accessed to extract the age corre- 
sponding to the extracted pointers to records (Step 1 78). 
Totals for both male and female sexes are incremented 
by 1 for each extraction, to calculate the total extracted 
age for both males and females (Step 180). 

55 [0118] A check is made as to whether or not all point- 
ers to records of the result set have been processed 
(Step 1 82), and if all pointers to records have been proc- 
essed, the average ages for both male and female stu- 



11 



21 



EP1 136 918 A1 



22 



dents is calculated by dividing the total ages for males 
and females by the total number (Step 184). 
[0119] Next, we shall add an explanation of Embodi- 
ment 7 of the present invention made with reference to 
the flowchart of operation illustrated in Fig. 20. In this 
embodiment, the so-called cross-tabulation is imple- 
mented. Note that the processing program shown in Fig. 
20 is also read and executed by the CPU 100. In this 
embodiment, we shall consider the case of finding 
counts by sex/by occupation taking the entire set of 
records regarding the table-format data of Fig. 1 . 
[01 20] Regarding the two fields of "sex" and "occupa- 
tion" used in tabulation, the respective value control ta- 
bles and the field value number specifying information 
array, namely, the array of pointers to the value control 
table, which express two pieces of individual field infor- 
mation, namely the first and second information blocks 
are kept in a storage device (Step 190). The memory 
device may be implemented in the form of, for example, 
memory, virtual storage, memory-mapped file or the 
like. 

[0121] Regarding the first information block regarding 
sex, as shown in Fig. 6, the total number of field value 
numbers for sex is "2," and regarding the second infor- 
mation block regarding occupation, as shown in Fig. 8, 
one can see that the total number of field value numbers 
for occupation is "4." Thus, an initialized 2x4 (2 row by 
4 column) two-dimensional array is generated as the 
space to store tabulation data (Step 192). 
[0122] Regarding the first and second information 
blocks, the field value numbers q ^ and respectively, 
are extracted sequentially from the beginning of the ar- 
ray of pointers to the value control table, and these are 
used to identify a single element P(q^, q2) in the two- 
dimensional array (Step 1 94), and then the value of the 
identified element P(q-,, q2) is incremented by 1 (Step 
196). 

[0123] A check is made as to whether or not all field 

value numbers (namely, a number of field value num- 
bers equal to the total number of records) have been 
extracted from the array of pointers to the value control 
table (Step 1 98), and if field value numbers still remain, 
then return to Step 194, but if not, terminate the tabula- 
tion process. This completes the two-dimensional cross 
table. 

[0124] In the aforementioned Embodiment 7 of the 

present invention, tabulation is performed on the entire 
set of records in the table-format data of Fig. 1 , but it is 
also possible to perform the same type of tabulation on 
a partial set of records, for example, tabulating a count 
of 1 6 year olds by sex/occupation. In order to do this, a 
single-field search is first performed using the age of 1 6 
as the search condition, and then the identifying infor- 
mation for records that match an age of 1 6 is acquired 
and kept. Next, as described previously with regard to 
searches on an AND of multiple fields, it is sufficient to 
take the records contained in this result set, namely the 
partial set of records as the object of the operation, and 



sequentially extract record identifying information con- 
tained in the result set starting from the beginning, and 
then extract the field value numbers corresponding to 
the record identifying information from the array of point- 

5 ers to the value control table, and take the field value 
numbers thus obtained as the row and column coordi- 
nates of a two-dimensional array while incrementing the 
count of element values by "1 ." Namely, in this process, 
as shown in Fig. 49, a process roughly equivalent to that 

10 shown in Fig. 9 is executed to generate a result set for 
containing pointers to records (Step 4901 ). Next, this re- 
sult set is used to execute a process roughly equivalent 
to that of Fig. 20. Here, first, in the same manner as in 
Fig. 20, the first and second information blocks regard- 

15 ing the fields used in tabulation (Step 4902), and next, 
an initialized two-dimensional array is generated (Step 
4903). 

[0125] Thereafter, a number that indicates the storage 
position of pointers in the result set (hereinafter referred 

20 to as the "storage position number" depending on the 
case) is initialized (Step 4904). In accordance with the 
program, regarding the various storage position num- 
bers, the CPU 100 extracts an array of pointers corre- 
sponding to the value control table in the first and sec- 

25 ond information block and identifies an element P(q-|, 
q2) within the two-dimensional (Step 4905), and next, 
the P(q-|, q2) is incremented (Step 4906). By executing 
this process with respect to all storage position numbers 
(namely, by processing the aforementioned field value 

30 numbers within the result set), it is possible to achieve 
cross-tabulation regarding a partial set obtained by 
searching or the like. 

[0126] Next, we shall add an explanation of Embodi- 
ment 8 of the present invention. In this example, cross- 
es tabulation is implemented in the situation wherein the 
field values of the field are divided into several catego- 
ries, by counting the counts for each category of field 
value. For example, referring to the information block 
regarding "occupation" shown in Fig. 8, one sees that 
40 the four field values of "student," "programmer," "teach- 
er" and "other" are registered for "occupation." As the 
categories based on these field values, one can envi- 
sion the case of recategorization into the three types of 
"income earner," "non-income earner" and "unknown." 
45 in this example, in this situation, a new category of 
"presence of income" is created to create a cross-tabu- 
lation of counts depending on sex/presence of income. 
[0127] The information block regarding "occupation" 
shown in Fig. 21 includes a value control table wherein 
50 category numbers are applied to each field value 
number based on the "presence of income" in particular. 
In this example, students are assigned a category 
number of "1 " (non-income earner), while programmers 
and students arc assigned a category number of "0" (in- 
55 come earner), and "other" is assigned a category 
number of "2" (unknown). 

[0128] As a general rule, the cross-tabulation in Em- 
bodiment 8 of the present invention has a process se- 
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quence roughly the same as that of the cross-tabulation 
in Embodiment 7, but it differs in the point that it uses 
as the coordinates that specify the element of the two- 
dimensional array to store the tabulation data, the field 
value number of the first information block regarding sex 
and the field value number of the second information 
block regarding occupation. 

[0129] Since any of the field value numbers or cate- 
gory numbers can be used as the coordinates of ele- 
ments in the two-dimensional array, in Embodiment 8 of 
the present invention, with respect to the first and sec- 
ond information blocks, the respective field value num- 
bers stored in each array of pointers to the value control 
table are extracted sequentially, and the coordinates of 
the element P in the two-dimensional array is identified 
based on either the field value number itself extracted 
from the array of pointers to the value control table or 
the category number stored in the value control table 
corresponding to the field value number. 
[0130] In the example described above, the informa- 
tion block on "sex" is used as the first information block, 
and the information block on "occupation" is used as the 
second information block (see Step 190 of Fig. 20). In 
the next processing step (Step 191), since the informa- 
tion block on "sex" contains two field value numbers and 
the information block on "occupation" contains three 
category numbers, an initialized 2 x3 (2 row by 3 col- 
umn) two-dimensional array is generated. 
[0131] In the subsequent processing steps also, the 
field value number q ^ of the first block and the category 
number q 2 of the second block are extracted, so these 
are used to identify a single element P(q-|, qg) and then 
the value of the element P is incremented (see Steps 
194 and 196). 

[0132] While the cross-tabulation according to Em- 
bodiments 7 and 8 described above is particularly tab- 
ulation in the form of finding counts, but note that the 
present invention may also be expanded to cross-tabu- 
lation wherein an average age is found depending on 
multiple fields (e.g., by sex/by occupation). In Embodi- 
ment 9, cross-tabulation of the aforementioned type is 
performed. 

[0133] In order for the tabulation method of a type 
wherein counts are found by incrementing the elements 
of a two-dimensional array one at a time, as in the afore- 
mentioned Embodiment 7 of the present invention, to 
be expanded to cross-tabulation of a type that requires 
operations other than the totaling of counts such as find- 
ing an average age, the variation shown in the operation 
flow chart in Fig. 22 is adopted. 
[0134] To wit, according to Embodiment 9 of the 
present invention, 2 two-dimensional arrays are used for 
tabulation, so regarding the first two-dimensional array, 
the counts by sex/by occupation are counted in the 
same manner as the aforementioned Embodiment 7, 
and regarding the second two-dimensional array, the to- 
tal age by sex/by occupation are calculated. 
[0135] Here follows a more detailed explanation. 



[01 36] First, a first, second and third information block 
for the three fields of sex, occupation and age are loaded 
into the storage device (Step 200). 
[0137] Corresponding to the total number of field val- 
5 ue numbers for sex and occupation of "2" and "4" re- 
spectively, an initialized 2X4 (2 row by 4 column) two- 
dimensional array for storing tabulation data is created 
(Step 202). 

[0138] Starting from the beginning of the array of 
10 pointers to the value control table of the first and second 
information blocks, the field value numbers q^, q2 and 
qg are extracted sequentially to identify the coordinates 
(q-i , q2) of an element of the two-dimensional array (Step 
204) and then the value of the element P^ (q^ , q2) of the 
15 first two-dimensional array thus identified is increment- 
ed by 1 each (Steps 206). 

[0139] Moreover, with respect to the information block 

regarding "age," the field value corresponding to the 
field value number of qg (namely, the age) is acquired 
20 (Step 208), and the acquired age is added to the ele- 
ment P 2(qi , q2) of the identified second two-dimension- 
al array (Step 210). 

[0140] After this processing, a check is made as to 
whether or not all subject records have been processed 

25 (Step 212), and if not, then control returns to Step 204, 
but if so, then the operation P2(qi , ^2V^^i^^ ' ^^2) P^^~ 
formed among the various elements of two-dimensional 
array P^ and two-dimensional array P2 (Step 206). 
Thereby, the average age by sex/by occupation is ob- 

30 tained and a cross-tabulation table of averages is cre- 
ated. 

[0141] Fig. 23 A is a conceptual explanatory diagram 
of a cross-tabulation table obtained in the aforemen- 
tioned Embodiment 7 of the present invention. In this 

35 manner, in the aforementioned Embodiment 7, counts 
for all combinations of sex/occupation are tabulated. 
However, as shown in Fig. 23B, among the by sex/by 
occupation categories, there may be cases wherein one 
wishes to know in particular the count of only those per- 

40 sons having a sex of female and occupation of student. 
By means of the present invention, the count in this case 
is obtained by finding the size of the result set from a 
search of an AND of the multiple fields of "female" AND 
"student." 

45 [0142] Similarly, in the aforementioned Embodiment 
9 of the present invention, a cross-tabulation table of the 

average age is found for all combinations of sex/occu- 
pation, but it is also possible to find in particular the av- 
erage age of only those persons having a sex of female 

50 and occupation of student . In this case, the count is 
found from the size of the result set from a search of an 
AND of the multiple fields of "female" AND "student," 
and the total of ages is found by adding the ages be- 
longing to records specified by the identifying informa- 

55 tion for records contained in the result set, and by cal- 
culating the fraction (total of ages)/(count), it is possible 
to find the desired value (e.g., the average age) regard- 
ing a specific cell in the cross-tabulation table for aver- 
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age age. 

[0143] Next, we shall add an explanation of Embodi- 
ment 10 of the present invention. Fig. 24 is a diagram 
illustrating multi-answer type fields, while Fig. 25 is an 
explanatory diagram of an information block of a type 
compatible with multi-answer type fields according to 
Embodiment 1 0 of the present invention. "Multi-answer" 
refers to the situation wherein, for example, when an- 
swers are obtained to the question "What kinds of writ- 
ing implements are now on the table? " then multiple 
answers such as "pencil, eraser" or "paper, pencil" are 
obtained from the same person. To wit, in the case of 
multi-answer, it is possible to specify multiple field val- 
ues for a single field of a single record. Fig. 24 shows a 
list of the responses to the aforementioned question ob- 
tained from 1 million people, given as is. 
[0144] In orderto process such data, by means of Em- 
bodiment 1 0 of the present invention, as shown in Fig. 
25, the array of pointers to the value control table of the 
information block differs from the array of field value 
numbers itself as described above, but rather 1 bit is 
allocated to each field value number in the pointers in 
the array. Therefore, it is possible to indicate whether or 
not a record specifies that field value number by means 
of turning bits on/off (namely, a binary number). There- 
by, it is possible to specify multiple field values contained 
in a single field in a single record. For example, in Fig. 
25, the pointers (bit pointers) within the array of pointers 
are 4-bit in size, and when the highest bit is on (namely, 
"1 "), this means that the response of "paper" is included, 
when the second bit is on the response of "ruler" is in- 
cluded, and when the third bit is on this means the re- 
sponse of "eraser" is included. Moreover, when the low- 
est bit is on, this means the response of "pencil" is in- 
cluded. 

[0145] The pointer corresponding to record number 
"0" has the value "3." This can be considered to be 
"21+20." Therefore, this can be understood as the re- 
sponses of "pencil" and "eraser" being included corre- 
sponding to this record number. In addition, the pointers 
corresponding to record number "1 " and record number 
"2" have the values "4" and "1 0," respectively, and these 
can be considered to be "2^" and "2^+2''." Therefore, 
one can thus know that the responses corresponding to 
these record numbers include "ruler" along with "eraser" 
and "paper," respectively. 

[0146] By means of this embodiment, each bit in the 
pointer value is given meaning so a plurality of field val- 
ue numbers can be indicated. Therefore, even in the 
case in which a record has a plurality of field values, this 
can be expressed by means of the pointer value. 
[0147] Note that the present invention has an advan- 
tage in that it can be easily adapted to a multi-answer 
situation by simply modifying the constitution of one por- 
tion of the information block. In fact, an information block 
thus modified can be used to replace the information 
blocks adopted in the various aforementioned embodi- 
ments of the present invention. 



[0148] Next, we shall add an explanation of Embodi- 
ment 11 of the present invention. Fig. 26 is an explana- 
tory diagram illustrating the method of handling blanks, 
error values and other special values that occur during 

5 tabulation processing, according to Embodiment 11 of 
the present invention. As shown in this figure, in Embod- 
iment 11, cross-tabulation is executed by taking blanks 
to be one category. When handling actual data, there 
may be cases in which blanks or log(-1 ) or other math- 

10 ematical errors appear. With the present invention, even 
if such special values (blanks, errors, etc.) are present, 
it has the advantage in that they are registered in the 
value control table as field values, and the registered 
special values can be used as is as categories for 

15 searches or tabulation. 

[0149] Next, we shall add an explanation of Embodi- 
ment 1 2 of the present invention. We shall describe de- 
lay evaluation using the flowchart of the operation of the 
method of searching upon multiple fields according to 

20 Embodiment 12 of the present invention shown in Fig. 
27. In this embodiment, in the same manner as in Em- 
bodiment 2 of the present invention, we shall consider 
the case of obtaining a set of records that satisfy both 
the first search condition of the "age" being "16" or "19" 

25 and the second search condition of the "occupation" be- 
ing "Student." 

[0150] In the aforementioned Embodiment 2 of the 
present invention, the category numbers for all records 
are set in advance for all records (Step 124 of Fig. 12), 

30 but in the case of Embodiment 1 2, the setting of the cat- 
egory numbers is performed for only the category num- 
bers corresponding to the field value numbers actually 
accessed based on the result set from the search on the 
first search condition. 

35 [0151] As described previously, from the first specific 
information block which is the information block regard- 
ing "age" which is the first field, a result set of records 
wherein the "age" is "16" or "19" is obtained according 
to Embodiment 1 of the present invention (Step 220). 

40 [0152] Next, the second specific information block 
which is the information block regarding "occupation" 
which is the second field shown in Fig. 8 is selected 
(Step 222), and the value of all category numbers in the 
value control table of the second specific information 

45 block is initialized to "-1 " for example (Step 224). 

[0153] Next, from the result set regarding the first 
search condition, record numbers that represent point- 
ers to records are extracted sequentially (Step 226). In 
the case of this example, as shown in Fig. 1 4, the record 

50 number "999999" is extracted, for example. 

[0154] Next, regarding the second specific informa- 
tion block, extract from the array of pointers to the value 
control table the field value number corresponding to the 
record number obtained under for the aforementioned 

55 first search condition (Step 228). In the case of this ex- 
ample, as shown in Fig. 14, the field value number of 
"0" corresponding to the record number of "999999" is 
extracted, for example. 
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[0155] Next, a check is made as to whether the value 
of the category number corresponding to the field value 
number extracted with respect to the second specific in- 
formation block is or not (Step 230). 
[0156] In the case that the category number is "-1," 
this means that the category number has not yet been 
set for that field value number, so a decision is made as 
to whether or not the field value corresponding to this 
field value number matches the aforementioned second 
search condition (Step 232), and if it matches then the 
category number is set to "1" (Step 234), but if it does 
not match, then the category number is set to "0" (Step 
236). 

[0157] In the case that the category number is not "- 
1 ," then a decision is made as to whether or not the value 
of the category number corresponding to the field value 
number extracted above is set to "1" (Step 238). If the 
value of the category number is set to "1 ," then add to 
the final result set a pointer to the record, e.g. the record 
number, corresponding to the location within the array 
of pointers to the value control table at which is stored 
a pointer which indicates the field value number in which 
the category number is set to "1 " (Step 240). In this ex- 
ample, as shown in Fig. 14, record number "999999" for 
example, is added to the final result set. If the category 
number is "0" then the final result set is not updated. 
[0158] The delay evaluation as shown in the afore- 
mentioned Embodiment 12 is effective in the following 
types of cases. For example, consider the case in which 
a customer database of 1 million people exists and one 
wishes to implement a telephone survey, and thus ex- 
tract a sample of 100 people. For example, when the 
people are narrowed down to those who satisfy stipu- 
lated conditions (sex, age, occupation, etc.) one can 
come up with 10,000 people, and then in order to ensure 
randomness, a search is performed based on the num- 
bers (e.g., "12") at the end of the telephone number. 
[0159] In this case, in Embodiment 12, first the ele- 
ments of the "category number array " arc filled with "- 
1" to evaluate only the aforementioned set of 10,000 
people. Namely, for the result set of a size of 10,000 
people, the elements of the category number array are 
accessed and if the element is "-1" then and only then 
the telephone number is accessed and the results of the 
access are given as elements of the "category number 
array." Thereby, it is possible to keep the number of 
checks down to 1 0,000. In this manner, by means of Em- 
bodiment 12, it is possible to reduce the number of 
processing steps greatly in comparison to an ordinary 
AND search. 

[0160] In addition, by using the information block ac- 
cording to the present invention, data that has a struc- 
ture like that of a telephone number consisting of the 
"country code + area code + central office code + 
number " can be divided and registered in multiple in- 
formation blocks, and this has an advantage in that 
searching and tabulation regarding a country code, area 
code or other partial data can be performed easily. 



[0161] In addition, by using the category number ac- 
cording to the embodiments of the present invention as 
described previously, it is possible to generate new cat- 
egories for ages, for example, by taking the ages 
5 "10-19" to be the "tens," the ages "20-29" to be the 
"twenties" and so on, and methods of searching and tab- 
ulating similar to those described above can be applied 
to the new categories thus generated. 
[0162] As described above, the apparatus for imple- 
menting searching and tabulating according to embod- 
iments of the present invention is implemented by 
means of an ordinary computer system shown in Fig. 

28, for example a personal computer including a CPU 
1 00, ROM 1 1 0, RAM 1 20, a hard disk 1 30, a display 1 40 
or other output device, and a keyboard/mouse 150 or 
other input device 150 connected to each other via a 
bus 160. Therefore, as described above, the program 
for constructing the information block for implementing 
the aforementioned embodiment (information block 
generating program) may also be recorded on CD- 
ROM, ROM 110 or the hard disk storage device 1 30, or 
may be supplied from outside via a network (not shown). 
[0163] With reference to the flowchart shown in Fig. 

29, here follows one example of the method of con- 
structing an information blockof the format shown in Fig. 
5 for the table-format data shown in Fig. 2B. 

Step 300: Data preparation 

[0164] First, data of the format shown in Fig. 2B is pre- 
pared. Next, this is divided by field. In Fig. 2B, it can be 
divided into the fields of "sex," "age" and "occupation." 

Step 311 : Generation of the information block for the 
"sex" field 

[0165] Generate one information block and this is to 
he the information block for the "sex" field, for example. 

Step 312: Generation of the value control table 

[0166] Next, initialize the value control table and scan 
the "sex" field data from the beginning to end, while 
counting the number of instances of each field and stor- 
ing this data. In the case of this example, up until this 
step, the field values of "female" and "male" are set in 
the array of field values 11 of the aforementioned value 
control table, while the values of "367436" and "632564" 
respectively are set in the array of counts 1 4 of the afore- 
mentioned value control table corresponding to the 
aforementioned field values. 

[0167] Next, the field values ("female" and "male") 
within the array of field values 11 are sorted according 
to a stipulated basis. Naturally, at the time of this sort, 
the array of counts 14 must also be reordered with the 
sorting of the array of field values 11 . 
[0168] Moreover, set the start position in the array of 
start positions 13 of the value control table. This start 
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position is found as tine cumulative total of counts cor- 
responding to the start position from the first count in 
the array of counts 1 4 within the value control table. Nat- 
urally, the value of the first start position is "0." 
[0169] Next, copy the content of the array of start po- 
sitions 1 3 to the array of category numbers 1 2. The array 
of category numbers 12 is used later as a work area at 
the time of creating the array of pointers to records. 

Step 313: Creation of the array of pointers to the value 
control table 

[0170] Next, allocate a storage area for the array of 
pointers to the value control table 20 . (The size of the 
storage area is the total count in the aforementioned ar- 
ray of counts 14.) 

[01 71 ] Next, extract one field value at a time from the 
"sex" field data from the beginning to the end, examine 
each field value to see if it matches the field value at 
each entry of the value control table, and if it matches 
the n^h field value, then set "n-1 " as a pointer to the value 
control table in the aforementioned array of pointers to 
the value control table. 

Step 314: Creation of the array of pointers to records 

[0172] Next, allocate a storage area for the array of 
pointers to records 30. In this example, the size of the 
storage area is the total of the counts in the aforemen- 
tioned array of counts 14. In the array of pointers to the 
value control table 20, from the beginning row to the 
ending row, extract one pointer to the value control table 
at a time. Extract the J^^ value of the array of pointers to 
the value control table 20 , and assuming its value is "K" 
then extract the category number corresponding to the 
K+1^h record of the value control table, and assuming 
its value is "L" then store "J-1" in the L+l^*^ clement of 
the array of pointers to records 30, and increment by 1 
the category number corresponding to the K+l^^ record 
of the value control table. 

[0173] The aforementioned operation completes the 
creation of the information block for the "sex" field (Step 
310). Information blocks for the "age" field and "occu- 
pation" field can be created in the same manner (Step 
320 and Step 330), and thus information blocks for the 
entire table-format data are obtained. 
[0174] Figs. 30 through 35 are explanatory diagrams 
for the procedure of creating the information block re- 
garding "occupation" in the table-format data shown in 
Fig. 1. 

[0175] Fig. 30 is a diagram illustrating populating with 
new data in the case in which categories are already 
defined and the types of attribute values are known in 
advance. Here, the value control table is created ac- 
cording to known category definitions. Since the start 
positions and counts are unknown, these are initialized 
to "0." In addition, storage areas are allocated for the 
array of pointers to the value control table and array of 



pointers to records and these are similarly initialized. 
[0176] Fig. 31 shows the pass in which the array of 
pointers to the value control table and the counts in the 
value control table are completed. The data to be pop- 

5 ulated is taken one item at a time starting from the be- 
ginning and its value examined as to which item (name- 
ly, which field value number) in the value control table it 
matches, and then it is stored in the array of pointers to 
the value control table and the corresponding count in 

10 the value control table is updated by "+1 " at a time. The 
example of Fig. 31 shows the state after the processing 
of the second item of data to be populated is complete. 
[0177] Fig. 32 shows the second pass for completing 
the value control table. The accumulation of counts uses 

15 the correspondence to the start position to find the start 
positions. Moreover, the value of the start position is 
copied to the category number. In the figure, the setting 
of the category number is complete. 
[0178] Figs. 33-35 show the third pass of data popu- 

20 lation. In this pass, one value (pointer) at a time is taken 
from the beginning of the array of pointers to the value 
control table, and the offset in the array of pointers to 
the value control table, namely the record number, is 
stored at the position in the array of pointers to records 

25 specified by the category number within the value con- 
trol table referenced by that value. Figs. 33, 34 and 35 
respectively show the processing of the first, second 
and last pieces of data of the array of pointers to the 
value control table of the information block regarding 

30 "occupation." 

[0179] Note that in the aforementioned explanation, 
the field of the category number is used as a work area, 
but any array that is an array of integers with a number 
of elements equal to or greater than the number of rows 

35 in the value control table, namely the total number of 
field value numbers, can also be used as the work area. 
[01 80] On the other hand, the population with new da- 
ta in the case that categories are not defined in advance 
is implemented by scanning the data to be populated 

40 and acquiring a list of values to be registered in the value 
control table and then, performing the aforementioned 
process of population with new data in the case that the 
categories are defined. 

[0181] Next, consider the case of adding an additional 
45 "Student" record to the information block regarding "oc- 
cupation" after the population with new data is complete 
as shown in Fig. 35. Fig. 36 is an explanatory diagram 
for this addition of a record. 

[0182] In this case, the field value number 0 which in- 
50 dicates "Student" is added to the end of the array of 
pointers to the value control table, and then the count of 
students in the value control table is increased by "1." 
Next, it is necessary to allocate space for storing the 
record number, namely the value of the pointer to the 
55 value control table (=1 000000), within the array of point- 
ers to records. To this end, the value at the end of the 
array of pointers to records corresponding to "Student" 
(in this example, 999999) is extracted and "1000000," 
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which is the expansion address, is stored. However, its 
sign may be reversed, for example, in order to identify 
this as an expansion address, and stored as "- 
1000000." Then, the end value of "999999" which was 
extracted previously as the expansion is stored, and fi- 
nally, the pointer value of "1000000" corresponding to 
the newly added record is stored. 
[0183] By adopting this method of adding data, the 
need to move large amounts of data at the time of adding 
data is avoided. In addition, in order to suppress the drop 
in access efficiency arising from the increase in the 
number of pointers when large numbers of records are 
added, it is sufficient to repeat the same processing as 
the third pass of the population with new data at appro- 
priate timing. 

[0184] Fig. 37 illustrates the structure of an informa- 
tion block according to another embodiment of the 
present invention. In the case that the structure shown 
in this figure is adopted, increases in the number of 
pointers are avoided and changes to data can be per- 
formed easily. In Fig. 37, the array of starting locations 
contains addresses that indicate the beginning of the ar- 
ea where the array of pointers to records is disposed. 
For example, "0" is stored as the start position for the 
field value of "Student." On the other hand, for the field 
value of "Programmer," the value of "n (where 
n>455214)" is allocated as the start position. 
[0185] Next, we shall add an explanation of Embodi- 
ment 13 of the present invention. In this embodiment, a 
sort of records is implemented using the aforemen- 
tioned information block. Fig. 38 illustrates the initial 
state of sorting records on the "occupation" field. The 
raw data shown in this figure shows the array of record 
numbers to be sorted. For example, an array of pointers 
to records obtained for fields other than "occupation," or 
a result set from a search can be used as the raw data. 
In the case of this example, for simplicity in the expla- 
nation, the record numbers of the raw data are arranged 
in the order "0, 1, 2, 9" but one must note that the 
order of record numbers prior to the sort will generally 
be random. The field values in the "occupation" field cor- 
responding to each record number are arranged in the 
order "Teacher, Programmer, Student, Other. " 
[0186] On the right side of this figure, the initial state 
of the various arrays contained in the information block 
regarding "occupation" is shown. The information block 
regarding "occupation" is created by the information 
block construction method explained with reference to 
Figs. 29-35. At the time of performing a sort of records 
on the "occupation" field, the value control table and ar- 
ray of pointers to the value control table of an information 
block regarding "occupation" prepared in advance is 
used. As the start position, the start positions set at the 
time of construction of the information block are used as 
is. The start positions are copied to the corresponding 
end positions. The area to contain the end position may 
be, for example, the area allocated for the count (the 
count array). The array of pointers to the value control 



table may be prepared in advance in record number or- 
der, for example. In the case of this example, the array 
of record numbers of the raw data are in descending 
order, so the connection between the record numbers 

5 of the raw data and the array of pointers to the value 
control table exhibits a simple relationship. In addition, 
the array of pointers to the value control table is an array 
for storing the sorted result set, so an area of the same 
size as the data to be sorted is allocated. The aforemen- 

10 tioned end positions are used as an array for storing the 
sorted results in the array of pointers to records. 
[0187] Fig. 39 is an explanatory diagram illustrating 
the first step of sorting according to Embodiment 13 of 
the present invention. In the first step, the beginning 

15 record in the raw data (in this example, the one with 
record number =0) is processed. The field value of the 
"occupation" field of the record with record number "0" 
is "Teacher." At this time, the field value number "2" 
which specifies a field value of "Teacher" is stored in the 

20 array of pointers to the value control table corresponding 
to record number "0." Then, the value "5" of the end po- 
sition corresponding to the field value number of "2" is 
extracted and this value of "5" is used as an address to 
set this record number "0" in the 5^^ position of the array 

25 of pointers to records where the sorted result set is 
stored. Next, the value of the end position correspond- 
ing to this field value number of "2" is incremented by 
"+1," so "5" is increased to "6." 

[0188] Fig. 40 is an explanatory diagram illustrating 

30 the second step of sorting according to Embodiment 1 3 
of the present invention. In the second step, the second 
record in the raw data (in this example, the one with 
record number =1) is processed. The field value of the 
"occupation" field of the record with record number "1" 

35 is "Programmer." At this time, the field value number "1 " 
which specifics a field value of "Programmer" is stored 
in the array of pointers to the value control table corre- 
sponding to record number "1." Then, the value "3" of 
the end position corresponding to the field value number 

40 of "1" is extracted and this value of "3" is used as an 
address to set this record number "1 " in the 3''^ position 
of the array of pointers to records where the sorted result 
set is stored. Next, the value of the end position corre- 
sponding to this field value number of "1 " is incremented 

45 by "+1 ," so "3" is increased to "4." 

[01 89] Thereafter, the same operation as in the afore- 
mentioned first and second steps are repeated for the 
remaining record numbers of "2, 3, 4, 5, 6, 7, 8, 9 " in 
the raw data. Fig. 41 shows the final state of the sort 

50 thus obtained. As can be seen from the sorted result set 
in this figure, the sort according to Embodiment 13 of 
the present invention results in the records being sorted 
in the order of the "occupation" field value numbers, be- 
ing reordered into the order of "2, 4, 6, 1 , 7, 0, 3, 5, 8, 9 

55 " by record number. 

[0190] In the aforementioned explanation of Embodi- 
ment 13, the case is envisioned wherein the raw data 
contains all records in the original table-format data. 
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namely the entire set. However, the sort according to 
the present invention is also effective on only a portion 
of records, namely on a partial set. Here follows an ex- 
planation of sorting a partial set in reference to Figs. 42 
and 43. 5 
[0191] Fig. 42 is a diagram illustrating the state of 
completion of the aforementioned sorting on a partial 
set. In the case of this example, the raw data given con- 
sists of a record with a record number of "0" in which the 
field value of the "occupation" field is "Teacher," and a io 
record with a record number of "1 " in which the field val- 
ue is "Programmer." When the sorting is applied to both 
of these records, sort results as shown by the sorted 
result set in this figure are obtained. At this time, the 
result set is contained in the array of pointers to records. ^5 
Therefore, an area of the same size as the entire set is 
allocated to store the sort results from a partial set. 
[0192] Thus, in the case of a sort on a partial set, it is 
preferable that the result set be compressed to the same 
size as the partial set. Fig. 43 shows the post-processing 20 
for this sorting on a partial set. This post-processing, 
namely the compression of the result set, comprises tak- 
ing the difference between the start position and end po- 
sition for each field value in the value control table, and 
extracting the count and storage position in the sort re- 25 
suits corresponding to the field value in question, and 
then arranging the sort results based on the extracted 
count and storage position. 

[0193] Next, we shall again add a general explanation 
of the aforementioned sorting of the entire set or partial 30 
set, using the flowchart in Fig. 50. Here, for both cases 
of an entire set and a partial set, the raw data is consid- 
ered to have storage position numbers attached in order 
starting from the beginning. For example, in Figs. 38 and 
41 , the storage position numbers match the record num- 35 
bers. However, in the case that the order of arrangement 
of raw data does not follow the record number, the stor- 
age location number differs from the record number. 
Note it goes without saying that the sorting described in 
detail regarding the aforementioned embodiment can 40 
be implemented by the CPU 100 executing a program 
stored in a stipulated area. 

[0194] In the sorting, first, the storage location number 

is initialized (Step 5001 ). Next, the corresponding point- 
er within the array of pointers to the value control table ^5 
is accessed for a certain storage location number (Step 
5002), and then identify the value of the end position 
where the field value number specified by the pointer is 
positioned (Step 5003). 

[0195] Thereafter, the corresponding record number 50 
is stored at the position within the array of pointers to 

records identified by the aforementioned end position 
(Step 5004). Then, in the value of the end position iden- 
tified in Step 5003 is incremented (Step 5005). The 
processing in the aforementioned Steps 5002 through 55 
5005 is performed for all of the raw data (see Steps 5006 
and 5007), and thereby, it is possible to obtain an array 
of pointers to records containing the stipulated record 
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numbers. One can see that the example of Fig. 39 cor- 
responds to Step 5002 through Step 5005 in the case 
that the storage position number is "0," and one can also 
see that Fig. 40 corresponds to Step 5002 through Step 
5005 in the case that the storage position number is "1 
[0196] In addition, in the case of sorting a partial set, 
after Steps 5001 through 5007 are executed, it is suffi- 
cient to compress the result set by means of the sort 
post-processing (see Step 5008). 
[0197] The sort according to the aforementioned Em- 
bodiment 13 of the present invention is a so-called "as- 
cending order" sort, namely a sort wherein the sort re- 
sults are arranged in order of increasing field value num- 
bers of the sorted field values. However, the sort results 
may also be arranged in "descending order, " wherein 
they are arranged in order of decreasing field value 
numbers of the sorted field values. The "descending or- 
der" sort is implemented by modifying the start position 
used in the case of an "ascending order" sort. In the case 
of this example, the starting positions for an "ascending 
order " sort are as follows: 



Student 


0 


Programmer 


3 


Teacher 


5 


Other 


6 



[0198] In contrast, the start positions for a "descend- 
ing order" sort become: 



Other 


0 




Teacher 


10-6= 


=4 


Programmer 


10-5= 


=5 


Student 


10-3= 


=7 



[01 99] A sort performed according to Embodiment 1 3 
of the present invention as such has the following ad- 
vantages. 

[0200] First, a high-speed sort is achieved. For exam- 
ple, in an operating environment using Pentium Pro® 
200MHz/Windows 95® , the novel sort according to the 
present invention achieved a sort time for 1 million of 
1 45 ms. I n contrast, in the case of a conventional Quick- 
sort, the time required to sort 1 million integers was 1530 
ms. 

[0201] Second, constant performance is obtained re- 
gardless of the type of field values stored as values. The 
present sort gives performance identical to that for inte- 
gers even when the data type of the field values is text 
or floating point, for example. In contrast, in the case of 
the conventional Quicksort or other algorithms, the 
speed is highest when the type of data handled is integer 
and lowest when the type is variable-length text. 
[0202] Third, the sorting speed does not drop even if 
the data size increases. With this sort, the sorting speed 
is expressed by 0(n) where n is the data size. On the 
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other hand, with Quicksort or other conventional high- 
speed sorting methods, the sorting speed is 0(n ■ log 
(n)), for example. 

[0203] Fourth, sorts on multiple fields can be divided 

into sorts on each field. For example, in Embodiment 13 
of the present invention described with reference to 
Figs. 38-41 , among the raw data, records corresponding 
to a field value of "Student" are arranged In the order 
record number "2," record number "4," and record 
number "6." And this order of record numbers (namely, 
record number "2," record number "4," and record 
number "6") Is preserved In the final sorted result set. 
This means that the order of records in the sorted result 
reflects the order of records prior to sorting within the 
scope of satisfying the purpose of the sort. By taking 
advantage of this characteristic of the present sort, a 
sort on multiple fields can be achieved by performing 
sequential sorts on each individual field. In contrast, with 
the conventional Quicksort, the state prior to sorting is 
known not to be reflected in the order of the sort results. 

Specific Examples 

[0204] In the aforementioned various embodiments of 
the present invention, the value control table contains a 
value listof the field values. For example, in the example 
shown in Fig. 7, the field value column contains a list 
Including the values "16," "17," "18," .... In addition, the 
value control table includes category numbers set for 
each field value number. Here follows an explanation of 
a specific example illustrating how the combination of 
such a value list and category number for a field value 
can be used to determine immediately whether or not 
multiple values for a certain field match search condi- 
tions by means of several comparative judgments. 
[0205] For example, consider the case wherein the 
following list is given as the value list, sorted by the mag- 
nitude of the value: 

0.1,0.2,..., 

100.0, 100.1, 100.2, 
1000.0 

and "value Is greater than 100 " is given as the search 
condition. 

[0206] First, from the value list, find the smallest value 
that does not satisfy the condition (in this example, 100). 
Then, set "0" as the category number for all the values 
In the value list before the smallest value, namely "1 00." 
In addition, set "1 " as the category number for all values 
in the value list after "100." Thereby, if the smallest value 
can be found, thereafter, the category number is set 
without performing any comparison operations, and 
thus a field value or field value number having a value 
satisfying the search conditions can be obtained. 
[0207] By using the bisection method or other known 



methods in the prior art, the smallest value can be found 
by a small number of comparison operations. For exam- 
ple, if there are N variations in the values present in the 
value list, then the number of comparison operations re- 
5 quired to find the aforementioned smallest value is 
roughly log2(N). 

[0208] In contrast. In the event that the value list Is not 
sorted on the magnitude of the value, If there are simi- 
larly N variations in the values present In the value list, 

10 then N comparison operations are required to find the 
value that satisfies the search condition. 
[0209] Since the value list Is sorted and category num- 
bers arc set in the value control table in this manner, the 
determination of whether or not the stipulated search 

15 conditions arc met can be speeded up. 

[0210] In addition, as would be naturally understand- 
able to a person skilled in the art, the content of the 
aforementioned value list and the search conditions are 
no more than a single specific example for explaining 

20 this example, and according to the present Invention, 
the determination of whether or not the stipulated search 
conditions are met can be speeded up for various value 
lists and for various combinations of search conditions. 
[0211] Next, we shall describe tests of searching for 

25 and tabulating 1 million records of data. The platform 
used in the tests was an ordinary personal computer 
equipped with a Pentium Pro® 200MHz processor and 
128MB of memory. Fig. 44 is a table showing the data 
used in the tests. The data consisted of one million num- 

30 bers in the range from "000000" to "999999" In the form 
of table-format data divided into three fields consisting 
of the 1 0,000's unit, the 1 0O's unit and the 1 's unit. Field 
values In the range from "00" to "00" appear 10,000 
times apiece for each field. 

35 [0212] Fig. 45 is a list of the test results showing the 
time required to search for/tabulate 1 million records, 
measured depending on the result set type. The result 
set type Is one two types, namely the aforementioned 
bit flag type and array of pointers type. The times in the 

40 test results are given in units of milliseconds (ms; 1/1 000 
seconds). 

[0213] The search performed in the aforementioned 
test is a search of an AN D of multiple fields, by connect- 
ing the three fields of "X10,000," "xlOO" and "xl" with 

45 an AND condition. The search was a cascade of the 
fields "X 10,000," "X100" and "xl" In this order. The In- 
termediate and final result sets from the search take the 
form of a bit flag or array of pointers as described above. 
The measured times are given as the average of five 

50 measurements. 

[0214] In addition, the tabulation in these tests con- 
sists of counting the number of times the various values 
(00 through 99) of the "xlOO" and "xl" fields appear in 
the result sets obtained from the search tests. The size 

55 of the table for this cross-tabulation is (100x100=) 
1 0,000 cells. The times are given in units of milliseconds 
and the average of 5 measurements Is given as the 
measured times. 
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[0215] The constitution of tine searcliing and tabulat- 
ing system for table-format data is in no way limited to 
the examples described in the aforementioned embod- 
iments, but rather the various constituent elements of 
the searching and tabulating system may be implement- 
ed in software (program), stored on a disk device or the 
like and, if necessary, the searching and tabulating sys- 
tem can be installed on a computer to perform the 
searching and tabulating of table-format data. Moreo- 
ver, the program thus implemented may be stored on a 
floppy disk or CD-ROM or other portable storage medi- 
um, and can be used in a general purpose fashion in a 
situation in which such a system is used. 
[0216] The present invention is in no way limited to 
the aforementioned embodiments but various modifica- 
tions are possible within the scope of the invention re- 
cited in the patent claims, and it goes without saying that 
these are also included within the scope of the present 
invention. 

[0217] For example, in the AND searches and OR 
searches illustrated in the aforementioned Embodiment 
2, searches were executed on two fields, but this is not 

a limitation, since it is clear that searches on three or 
more fields can also be implemented. 
[0218] In addition, in the aforementioned Embodi- 
ment 7 and Embodiment 8, a two-dimensional array was 
generated in order to perform a tabulation on two fields, 
but this is not a limitation, as it is possible to generate a 
three-dimensional or higher-dimensional array in order 
to perform a tabulation on three or more fields, and it 
goes without saying that these can be used to perform 
the aforementioned tabulation. Considering the tabula- 
tion of three fields, field value numbers q i , q, in each 
of the three information blocks are extracted, and this is 
used to identify one element P(q-|, qs) in the three- 
dimensional array. 

[0219] Moreover, regarding the aforementioned Em- 
bodiment 9 also, it goes without saying that it is possible 
to performed tabulation on three or more fields in the 
same manner as in Embodiment 7 and Embodiment 8. 
[0220] In addition, while the searching, tabulating 
and/or sorting are implemented by reading a stipulated 
program into an ordinary computer system and then ex- 
ecuting the program in the aforementioned embodi- 
ments, the present invention is in no way limited to this, 
but rather it goes without saying that it may be consti- 
tuted such that a board computer used exclusively for 
data processing is connected to a personal computer or 
other ordinary computer system, and this board compu- 
ter can execute the aforementioned processing. There- 
fore, in this s pecification, the word means does not nec- 
essarily mean a physical means, but rather it includes 
the case in which the functions of the various means are 
implemented by software and the case in which some 
or all of the functions are implemented by hardware. 
Moreover, the functions of a single means may be im- 
plemented by two or other physical means, or the func- 
tions of two or more means may be implemented by one 



physical means. According to the aforementioned de- 
scription, by means of the present invention, it is possi- 
ble to process large amounts of data expressed in table 
format without using the conventional data tables which 
5 required long access times, so the speed of tabulating 
and searching can be greatly increased. 

FIELD OF THE INVENTION 

10 [0221] The present invention is particularly suited for 
use in systems that handle large amounts of data, for 
example, databases and data warehouses. More spe- 
cifically, it is suited to large-scale scientific and technical 
calculation, control systems for plants and power supply 

^5 and the like, methods of planning of delivery and re- 
source distribution, and to order management and the 
management of clerical work such as securities trading. 



20 Claims 

1. When table-format data is represented as an array 
of records including field values with respect to a 
specific field of information, a method of extracting 

25 from said table-format data the field value corre- 
sponding to a specific field and a specific record, 

said method being characterized in compris- 
ing the steps of: 

30 keeping in a storage device a value control ta- 

ble containing field values which are located in 
order of field value numbers each correspond- 
ing to the field value belonging to the specific 
field, and a field value number- specifying infor- 
ms mation array containing information that speci- 
fies said field value numbers in the order of said 
records, 

acquiring from said field value number- speci- 
40 fying information array the field value number 

corresponding to said specific record, and 

obtaining from the field values stored in said 

value control table the field value correspond- 
45 ing to the field value number acquired as above. 

2. The method according to claim 1 , characterized in 
that, in order to categorize the field values corre- 
sponding to said field value number, category num- 

50 bers are stored in said value control table corre- 
sponding to said field value number, and 

said category numbers are accessed at the 
time of obtaining the field value corresponding to 
said field value number. 

55 

3. When table-format data is represented as an array 
of records including field values with respect to a 
field associated with a search condition, a method 
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of searching through said table-format data for field 
values that match specific search conditions, 

said method being characterized in compris- 
ing the steps of: 

5 

keeping in a storage device, for each individual 
field, individual field information such that in- 
cludes a value control table containing field val- 
ues which is located in order of field value num- 
bers each corresponding to the field value be- io 
longing to the field associated with the search 
condition, a field value number- specifying in- 
formation array containing information that 
specifies said field value numbers in the order 
of said records, and a record identification in- ^5 
formation array storing in exclusive areas for 
each of said field value numbers one or more 
pieces of record identification information relat- 
ed to identical field value numbers, and said 
value control table includes, for each of said 20 
field value numbers, record identification infor- 
mation-specifying information that indicates 
the area where said one or more pieces of 
record identification information related to iden- 
tical field value numbers in said record identifi- 25 
cation information array, 

using said record identification information- 
specifying information corresponding to field 
value numbers related to field values within the 30 
field values contained in said value control ta- 
ble that match said search conditions, to ac- 
quire record identification information from said 
record identification information array that 
matches said search conditions. 35 

4. The method according to claim 3, characterized in 
comprising the steps of: 

l^eeping in a storage device the result set of 40 
records that match said search conditions 
based on the record identification information 
which matches the search conditions, 

regarding other individual field information re- 45 
lated to other search conditions, 

acquiring from said field value number- speci- 
fying information array regarding said other in- 
dividual field information the field value num- 50 
bers corresponding to the pieces of record 
identification information that match said 
search conditions set in said result set, 

regarding said other individual field informa- 55 
tion, determining whetheror notthe field values 
associated with said extracted field value num- 
bers match said other search conditions. 



regarding said other individual field informa- 
tion, if the field values associated with said ex- 
tracted field value numbers match said other 
search conditions, extracting said pieces of 
record identification information corresponding 
to said field value numbers as pieces of record 
identification information that match said sepa- 
rate search conditions, and 

if necessary, extracting said field value num- 
bers with respect to still other individual field in- 
formation regarding still other search condi- 
tions, and repeating the determination of 
matching and extraction of records. 

5. The method according to claim 3, characterized in 

comprising the steps of: 

keeping in a storage device the result set of 
records that match said search conditions, 

regarding fields related to other search condi- 
tions, 

using field values within the field values stored 
in other value control tables that match said 
search conditions and record identification in- 
formation-specifying information correspond- 
ing to related field values to extract from a 
record identification information array the 
records that match said other search condi- 
tions, and store the records that match the 
search conditions in a specified other record 
set, 

if necessary, regarding still other search condi- 
tions, using still other record identification infor- 
mation-specifying information to extract 
records that match still other search conditions, 
and repeating the storage of still other result 
sets, and 

obtaining a final result set by eliminating dupli- 
cate records from the result sets thus obtained. 

6. The method according to any of claims 3 through 5, 
characterized in that said value control table com- 
prises, for each of said field value numbers, a start 
position that indicates the starting point of said ex- 
clusive area, and a count that indicates the number 
of records that have identical field value numbers, 
and 

a stipulated record identification information 
array is identified by accessing said start position 
and count. 

7. The method according to any of claims 3 through 6, 
characterized in that the category numbers for 
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categorizing the field values corresponding to said 
field value numbers in said value control table are 
stored corresponding to said field value number, 
and 

said category number is used to identify the 5 
field values that match the search conditions. 

8. When table-format data is represented as an array 
of records including a plurality of fields containing 
field values with respect to a field of information, a io 
method of tabulating said table-format data, 

said method being characterized in compris- 
ing the steps of: 

if n represents an integer equal to 1 or greater, ^5 
for each of n fields used in tabulation, keeping 
in a storage device individual field information 
including a value control table containing field 
values, which are located in order of field value 
numbers each corresponding to the field value 20 
belonging to the field, and a field value number- 
specifying information array containing infor- 
mation that specifies said field value numbers 
in the order of said records, 

25 

if i represents an integer in the range 1 <i<n, for 
the i^h individual information field, the total 
number of said field value numbers is repre- 
sented by N j, k| represents an integer in the 
range 0 <kj<Nj-1 , M represents an integer equal 30 
to 1 or greater, and if m is an integer in the range 
1 <m<M, then initializing elements P^{W^ , 
kj, kn) of n -dimensional M data spaces hav- 
ing a size of N ^ xN2X...xNjX...xNn, 

35 

for said n individual information fields, when j 
represents an integer in the range 0<j<(total 
number of records)-1 , extracting the respective 
field value numbers stored in the position in 
each field value number- specifying information 40 
array, and when the field value number extract- 
ed from the i^^ individual information field is rep- 
resented by qj, identifying the elements Pm(c|i, 
q2, qj, qp) of said data space, and 

45 

processing said identified values of the ele- 
ments Pm(qi,q2. — — qn)- 

9. When table-format data is represented as an array 

of records including a plurality of fields containing 50 
field values with respect to a field of information, a 
method of tabulating said table-format data, 

said method being characterized in compris- 
ing the steps of: 

55 

if n represents an integer equal to 1 or greater, 
for each of n fields used in tabulation, keeping 
in a storage device individual field information 



including a value control table containing field 
values which are located in order of field value 
numbers each corresponding to the field values 
belonging to the field, and afield value number- 
specifying information array containing infor- 
mation that specifies said field value numbers 
in the order of said records, 

if i represents an integer in the range 1 <i<n, for 
the i^h individual information field, the total 
number of either said field value numbers or 
said category numbers is represented by Nj, k,- 
represents an integer in the range 0<kj<N|-1 , M 
represents an integer equal to 1 or greater, and 
if m is an integer in the range 1 <m<M, then in- 
itializing elements Pm(*^i. ^^n) ^~ 
dimensional M data spaces having a size of 

NiXN2X...XNjX...XNn, 

for said n individual information fields, when j 
represents an integer in the range 0<j<(total 
number of records)-1 , extracting the respective 
field value numbers stored in the position in 
each field value number- specifying information 
array, and when the field value number extract- 
ed from the i individual information field or the 
category number stored corresponding to said 
field value number in the value control table of 
said i^^ individual information field is represent- 
ed by qj, identifying the elements P^iq^, 
qj, q^^) of said data space, and 

processing said identified values of the ele- 
ments Pr^(qi,q2, ...,qj, qJ- 

10. The method according to claim 9 or claim 1 0, char- 
acterized in that M=1 is true, and the step of 
processing the value of said identified element 
includes adding 1 to the current value of said ele- 
ment P ^. 

1 1 . The method according to claim 9 or claim 1 0, char- 
acterized in that the step of processing the value 
of said identified element P^^ includes: 

for at least one element P ^ among the M ele- 
ments P 

for separate individual field information kept in 
a storage device, acquiring the field value num- 
bers stored in the f ^ position in the field value 
number- specifying information array, 

from among the field values stored in the value 
control table of said separate individual field in- 
formation, acquiring the field value correspond- 
ing to said field value number thus acquired, 
and 
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updating the current value of said element 
and the value of said element P^^ in combina- 
tion with said field value thus obtained. 

12. The method according to any of claims 1 through 5 
11, characterized in that said information that 
specifies the field value number is the field value 
number itself. 

13. The method according to any of claims 1 through io 
11, characterized in that said information that 
specifies the field value number is a binary value 
wherein 1 bit is allocated to each field value number, 
thus setting whether or not it is set. 

15 

14. When table-format data is represented as an array 
of records including a plurality of fields containing 
field values with respect to a field of information, an 
apparatus for searching for and tabulating said ta- 
ble-format data, 20 

said apparatus being characterized in com- 
prising: 

astorage device for keeping, for each individual 
field, a value control table containing field val- 25 
ues, which is located in order of field value num- 
bers each corresponding to the field value be- 
longing to the specific field, and a field value 
number-specifying information array contain- 
ing information that specifies said field value 30 
numbers in the order of said records, 

field value number acquisition means for ac- 
quiring from said field value number-specifying 
information array kept on said storage device 35 
the field value number corresponding to said 
specific record, and 



ray, and 

furthermore, has search means for using said 
record identification information-specifying infor- 
mation corresponding to field value numbers relat- 
ed to field values within the field values contained 
in said value control table that match stipulated 
search conditions, to acquire record identification 
information from said record identification informa- 
tion array that matches said search conditions. 

16. When table-format data is represented as an array 
of records including a plurality of fields containing 
field values with respect to a field of information, a 
computer-readable storage medium upon which is 
recorded a program for searching for and tabulating 
said table-format data, 

said storage medium being recorded with a 
program characterized in comprising: 

a step of keeping in a storage device, for each 
individual field, a value control table containing 
field values which is located in order of the field 
numbers each corresponding to the field value 
belonging to the field, and a field value number- 
specifying information array containing infor- 
mation that specifies said field value numbers 
in the order of said records, 

a step of acquiring from said field value 
number- specifying information array kept on 
said storage device the field value number cor- 
responding to said specific record, and 

a step of obtaining from the field values stored 
in said value control table kept on said storage 
device the field value corresponding to the field 
value number acquired as above. 



field value obtaining means for obtaining from 
the field values stored in said value control ta- 40 
ble kept on said storage device the field value 
corresponding to the field value number ac- 
quired as above. 

15. The apparatus according to claim 11, character- ^5 
ized in that said storage device keeps individual 

field information that has said value control table, 
said field value number-specifying information ar- 
ray, one or more pieces of record identification in- 
formation related to identical field value numbers, 50 
and a record identification information array stored 
in exclusive areas for each of said field value num- 
bers, and said value control table includes, for each 
of said field value numbers, record identification in- 
formation-specifying information that indicates the 55 
area where said one or more pieces of record iden- 
tification information related to identical field value 
numbers in said record identification information ar- 



17. When table-format data is represented as an array 
of records including a plurality of fields containing 
field values with respect to a field of information, a 
computer-readable storage medium upon which is 
recorded a program for searching through said ta- 
ble-format data for records that contain a field value 
that matches search conditions, 

said program being characterized in com- 
prising: 

a step of keeping in a storage device, for each 
individual field, individual field information such 
that includes a value control table containing 
field values which is located in order of field val- 
ue numbers each corresponding to the field val- 
ue belonging to the field associated with the 
search condition, a field value number- speci- 
fying information array containing information 
that specifies said field value numbers in the 
order of said records, and a record identifica- 
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tion information array storing in exclusive areas 
for each of said field value numbers one or 
more pieces of record identification information 
related to identical field value numbers, and 
said value control table includes, for each of 5 
said field value numbers, record identification 
information-specifying information that indi- 
cates the area where said one or more pieces 
of record identification information related to 
identical field value numbers in said record io 
identification information array, and 

a step of using said record identification infor- 
mation-specifying information corresponding to 
field value numbers related to field values with- ^5 
in the field values contained in said value con- 
trol table that match said search conditions, to 
acquire record identification information from 
said record identification information array that 
matches said search conditions. 20 

18. A computer program product that is loadable into 

the memory of a computer, and that implements a 
method according to one of claims 1 through 13. 

25 

19. A sorting method whereby an array of record iden- 
tification information specifying records including a 
plurality of fields containing field values with respect 
to a field of information is rearranged on a specific 
field, 30 

said method being characterized in compris- 
ing the steps of: 



information. 

20. The method according to Claim 1 9, further compris- 
ing the step of: 

keeping in a storage device, for each individual 
field, individual field information such that in- 
cludes a value control table containing field val- 
ues which is located in order of field value num- 
bers each corresponding to the field value be- 
longing to the field associated with the search 
condition, a field value number- specifying in- 
formation array containing information that 
specifies said field value numbers in the order 
of said records, and a record identification in- 
formation array storing in exclusive areas for 
each of said field value numbers one or more 
pieces of record identification information relat- 
ed to identical field value numbers, and said 
value control table includes, for each of said 
field value numbers, record identification infor- 
mation-specifying information that indicates 
the area where said one or more pieces of 
record identification information related to iden- 
tical field value numbers in said record identifi- 
cation information array, and 

said storing step being arranged such that the 
pieces of record identification information is 
stored in the storage location according to the 
record identification information-specifying in- 
formation. 



for each of said records, associating said 
record identification information with field value 35 
numbers corresponding to the field values of 
said fields, 

for each of said field value numbers, defining 
the storage location after reordering said 40 
record identification information, 

sequentially extracting said record identifica- 
tion information from said array, 

45 

determining said field value number corre- 
sponding to said record identification informa- 
tion thus extracted, 

storing said record identification information 50 
thus extracted in said storage location accord- 
ing to the record identification information- 
specifying information corresponding to the 
field value number thus determined, and 

55 

updating said storage location where said 
record identification information is to be stored, 
in order to store the next record identification 



21. The method according to claim 20, characterized 

in that said record identification information-speci- 
fying information comprises a start position that in- 
dicates the starting point of the area of said storage 
location, and an end position that is initially equiva- 
lent to the start position and whose value is incre- 
mented upon said update. 

22. A sorting apparatus that rearranges an array of 
record identification information specifying records 
including a plurality of fields containing field values 
with respect to a field of information, 

said apparatus being characterized in com- 
prising: 

for each of said records, means of associating 
said record identification information with field 
value numbers corresponding to the field val- 
ues of said fields, 

for each of said field value numbers, definition 
means for defining the storage location after re- 
ordering said record identification information, 

field value number determination means for se- 
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quentially extracting said record identification 
information from said array and determining 
said field value number corresponding to the 
record Identification Information thus extracted, 

5 

record identification information storage means 
for storing said record Identification Information 
thus extracted In said storage location accord- 
ing to the record Identification information- 
specifying information corresponding to the 10 
field value number thus determined, and 

updating means for updating said storage loca- 
tion where said record Identification Information 
is to be stored, in order to store the next record ^5 
Identification Information. 

23. The apparatus according to Claim 22, further com- 
prising: 25. 

20 

a storage device for keeping, for each Individual 
field, a value control table containing field val- 
ues for that field corresponding to a field value 
number that uniquely identifies said field value, 
which is a field value number that Is common 25 
to the various fields and has a stipulated order 
from an initial value, and a field value number- 
specifying Information array containing infor- 
mation that specifies said field value numbers 
In the order of said records, 30 



a step of sequentially extracting said record 
Identification information from said array, 

a step of determining said field value number 
corresponding to said record identification in- 
formation thus extracted, 

a step of storing said record identification infor- 
mation thus extracted in said storage location 

according to the record identification informa- 
tion-specifying information corresponding to 
the field value number thus determined, and 

a step of updating said storage location where 
said record identification information Is to be 
stored, In order to store the next record identi- 
fication information. 

A computer program product that is loadable Into 
the memory of a computer, and that implements a 
method according to one of claims 1 9 or 21 . 



the value control table further includes record 
Identification information-specifying informa- 
tion that, corresponding to said field value 
number, indicates the area in said record iden- 35 
tification information-specifying information ar- 
ray where said one or more pieces of record 
Identification information regarding identical 
field value numbers are stored, 

40 

24. A computer-readable storage medium upon which 
Is recorded a sorting program for rearranging on a 
specific field an array of record Identification Infor- 
mation specifying records including a plurality of 
fields containing field values with respect to a field ^5 
of Information, 

said sorting program being characterized in 
comprising: 

for each of said records, a step of associating 50 
said record identification information with field 
value numbers corresponding to the field val- 
ues of said fields, 

for each of said field value numbers, a step of 55 
defining the storage location after reordering 
said record identification information, 



25 



EP1 136 918 A1 



FIG. 1 

(PRIOR ARD 



RECORD NUMBER 


SEX 


AGE 


PROFESSION 


0 


FEMALE 


18 


PROGRAMMER 


1 


MALE 


21 


STUDENT 


2 


FEMALE 


31 


TEACHER 










999999 


FEMALE 


16 


STUDENT 



FIG. 2A 

(PRIOR ART) 



FIG. 2B 

(PRIOR ART) 



CO :3o 



5> 2S 
m 

CO t— I 

CO :s 
m CO 
CO ::o 
m 
5> 
CO 



<7> 



999999 



FEMALE 
18 

PROGRAMMER 



MALE 
21 

STUDENT 



FEMALE 
31 

TEACHER 



FEMALE 
16 

STUDENT 



o 

C73 



3> 
O 
<=> 

m 

CO 
CO 

m 

CO 



I — I 

50 

m 
O 
— 1 
•—I 
O 

o 



o 



3> 



0 

1 

2 

999999 
0 
1 
2 

999999 
0 
1 
2 

999999 



FEMALE 
HALE 
FEMALE 

FEMALE 



18 
21 
31 

16 

PROGRAMMER 
STUDENT 
TEACHER 

STUDENT 



FIG. 28 






20 



160 



HARD 

DISK 



vr 



30 



i 



DISPLAY 



140 



KB/ 
MOUSE 



150 



26 



EP1 136 918 A1 



FI6. 3 



ARRAY OF POINTERS TO VALUE 
CONTROL TABLE 



POINTERS 



0 
1 




0 
1 



VALUE CONTROL TABLE 

FIELD CATEGORY 
VALUE NO. 



11 



12 10 



FIG. 4 



0 

t 



ARRAY OF POINTERS TO VALUE 
CONTROL TABLE 



POINTERS 




0 
1 



VALUE CONTROL TABLE 
HELD CATEGORY 



VALUE 



n 



NO. 



12 



COUNT 



M 10 



27 



EP1 136 918 A1 




EP1 136 918 A1 



CO 



CO 

o 
z 

5 
or 
< 
O 

or 

o 
o 
-J 

CO 

o 

g 

cr 
o 



CO 
Q 



C/) 

cr 

LLI 



00 

cr 

UJ 

z 
o 

O o S 

J^o o 
>: UJ Q- 

c o 
< I- 



o 



CO I ^ 

9^ I to _j 
o o - < 

O ^ X 2 
UJ X UJ ^ 



— UJ * 
CO X UJ 

O ^ X CO 2 
CO UJ X UJ 



q: S: □ cc 5 w 































o 


csi 




6666 1 



o ^ 



CO ^ 
CD CD 

lo in 

CM CN4 
CO CO 
CO CO 



cn 
o> 

CJ> 



LU 
-J 
CD 
< 
(- 

-J 
O 

cr 
h- 
z 
o 
o 

Lii 

< 
> 



o 
o 



2 

1^ CO 
CO o 
Ol 

CC 

o . 
o o 

UJ z 

< 
o 



UJ 

< 
> 





CO 


CO 


CO 


lO 


Tj- 


CN4 


r-* 


CO 


<£> 


CD 


CO 


O 














CM 




CO 




C£7 








lU 


LU 






< 


< 






UJ 




U. 



11 

UJ ^" 

^- 

Z O 

Q-z 
o o 

< 3 

cr -J 
cr < 

< > 




CO 

cr 

UJ 



O . . . 

Q- o ^ eg 



cn 



29 



EP1 136 918 A1 



UJ 

o 
< 

o 
z: 

o 
q: 
< 
a 

uu 

o 
o 

-J 
CD 

O 

cr 
O 



CO 

cr 

UJ 

2: 

o 
a 

o 

< 

q: 

< 



00 
Q 

cr 
o 
o 

UJ 

on 



CO 

q: 

LU 

2: 
o 



CO X 

U. Q ^- 

H O ^ UJ 

□ a 5 <5 

















CD 






0 












C7> 






CQ 
-J 

o 
q: 
h- 
2 
o 
o 

LU 

-J 



o 
o 



00 O 
C3L 

>- 

o . 
o o 



LU 



r* 00 

oo 00 
10 10 



o> 

<J> 

cn 
cn 

<7> 



CO 


0 


CD 


CO 


CO 


10 




CO 






CO 


to 


CO 


10 




CO 










00 


10 


CO 


<o 


CO 


10 




CO 




to 


to 


00 


CO 


O) 


10 


10 




to 












CO 


CM 


to 




CO 






0 


00 


CO 




0 


CO 




CO 




CD 




cn 




CO 


Oi 


CO 








00 




CO 












to 




Tl- 




to 




00 




CO 








a> 






CM 


CO 


CM 


to 








o> 








CM 




10 




cn 




o> 


























00 




0 




















CM 


CM 




CO 




5 



o 

CO 

q: 

LU 

z 

o 
a 

LU 

O 

< 

on 
< 



CD 
< 



O 

cr 

o 
o 

UJ 

5 



O ^ CM CO ^ to 



to 



CO 
CO 




CO 

cr 

UJ 

2: 

o . . 



CM 



to 



a> 

CJ> 

cn 

<7> 

a> 



30 



EP1 136 918 A1 



< 
a 
z> 
o 
o 
o 

a 
z 

o 
a 
< 



o 
o 

CD 

z 
o 

cr 
o 

u. 



cr 
tu 
I- 
z 

o 
a 



00 X H 9 

Q I- < 

o o ^ 

CO QJ X O ,^ 



O 



CO 



CO 

cr 

UJ 

I- 

oo 5 



< 
cr 
cr 
< 



CO 
Q 

cr 
o 
o 

LU 

cr 
o 



o 
a 



3 
O 

o 



LU 
-J 
CO 

< 



o 



o 
o 

LU 
ZD 
-I 
< 
> 



11 

CO 

CO o 

Q- 

> 

cr 
o 

o o 

g= 



UJ 




cn 





a> 


o 


r- 




r- 




CO 




oo 


CO 


CM 


in 


CO 




CM 


lO 


o> 




CM 




CO 






o 




CO 


CO 








<o 






o 






in 


CM 






lO 


lO 








CO 












h- 


ER 


cr 




Z 




LU 


cr 


UJ 




X 


LU 


o 


-< 

a: 


o 


X 




O 


< 






O 


UJ 


O 


CO 


PR 







O LU 
H -J 

uu ^ 
cr 



o o 

< 3 

cr -J 
cr < 

< > 



O CM CO 



CO 

cr 

UJ 

O . 




CM 



CM 



a> 
a> 
o> 
c» 



31 



EP1 136 918 A1 



FIG. 9 



Q START ^ 



SELECT SPECIFIC 
INFORMATION BROCK 



SET I IN IHE CATEGORy NliMBER OF ROWS 
THAT MATCH THE SEARCH CONDITIONS 



I 



EXTRACT STARTING POSITIONS 
AND COUNTS 




r 


EXTRACT POINTERS FROM THE 
ARRAY OF POINTERS TO RECORDS 




r 


CREATE RESULT SET 




r 



c 



END 



■S100 



•S102 



-8104 



■S106 



■8108 



32 



EP1 136 918 A1 



LU 
O 
< 

a 

5 
q: 
< 
o 

LU 

cr 

o 
o 
—I 



< 
O 



CO 

cr 

LU 



O 
a 



00 

cr 

LiJ 



cr 
o 



o p ^ 

< 
cr 
cr 
< 











CD 






a> 






cn 






O) 












CO 
CD o> 
CO oo 



a> 

CD 



LU 
-J 
CD 
< 



o 
cr 
h- 
z 
o 
o 

LU 
-J 

5 



2 

O 
O 

I— o 

CO 
CO O 

>- 
cr 

o . 

CD O 

o 



33 



O LU 
h- -J 

2? 

LU 

h- -J 
Z O 

o o 

cr ZS 
cr < 
< > 



O CM CO ^ to 



OO 


o 


o> 


CO 


CO 


in 




CO 






CD 


in 


oo 


to 




CO 










CO 


to 


CO 


CO 


CD 


m 




CO 




to 


lO 


CO 


CO 


o> 


in 


m 




in 












oo 


CM 


lO 




CO 


























o 


oo 


oo 




o 


CO 




<£> 




CD 




o> 




CO 


o> 


CO 








CO 




oo 












in 




^ 




in 




CO 




CO 








CD 






CM 


CO 


CM 


to 








CD 








CM 




in 




CD 




CD 




o 


o 




O 


o 




o 




O 


CO 




00 


CD 


o 






















CM 




CO 




GO 



in . in 



00 

cr 

LU 




O 



to 



CD 
CD 
CD 
CD 
O) 

cn 



33 



EP1 136 918 A1 







CD 




CD 




cr> 














UJ 

a 
< 

o 
z 

o 
a: 
< 
O 

LLI 

o 
o 

Z 

o 

q: 
o 



LU 
-J 

< 



O 

cr 

Z 

O 

o 



o 
o 



H- O 

CO 
CO O 

a. 

>- 
a: 

o . 
c? o 



< 
o 



tu 



UJ 



CO 


o 


a> 


CO 


CO 


in 




oo 






CD 


in 


00 


to 




CO 




^ 






CO 


in 


<£> 


CO 


C£> 


in 




CO 




tn 




CO 


CO 


<Ti 


lf> 


in 




lO 












OO 


CM 


in 




CO 


























o 


CO 


00 




O 


CO 




CO 








<T> 




n 


cn 


CO 








00 




oo 












iO 








in 




oo 




CO 














CM 


CO 


Cn4 


in 








a> 








CM 




in 




o> 




CJ> 




o 


O 




O 


O 




o 




O 


CO 




00 


CJ> 


o 




















CM 


CM 








00 



cr 5 

LU 



O <M €0 IO 



in 

<o 



O 
Q. 

U. 

O 

>- 
< 

cr 
< 



o 
cr 



o 
o 

LU 

< 
> 




34 



EP1 136 918 A1 



FIG, 12 



c 



START 



:> 



OBTAIN RESULT SET FOR THE 
FIRST SEARCH CONDITION 



■S120 



SELECT SECOND SPECIFIC 
INFORMATION BLOCK 



I 



SET THE CATEGORY NUMBER THAT MATCHES 
THE SECOND SEARCH CONDITION REGARDING THE 
SECOND SPECIFIC INFORMATION BLOCK 



■SI24 



I 



EXCTRACT POINTERS TO RECORDS 
FROM THE RESULT SET 



•8126 




35 



EP1 136 918 A1 



C3 



00 
QC 
LU 
I- 

z 
o 

Q. 



Q ^ E 

T °- 

O o ^ 
w ui ? O ,^ 



o 



o S 

o o 

LU a 
a. 























6666 1 






CD 
CD 
CD 
<T> 
CD 

a> 



d 



CD 

o> 

CD 
CD 

o> 
Oi 



36 



EP1 136 918 A1 




00 



oo S 
o o 



q: Q. r- 
O =5 ^ 

W LU X O ' 



ID 

o 
o 
o 

2 

S 
or 
< 
o 

UJ 
O 

o 

-J 
ffi 

2: 
o 

p 

i 

o 



2 O 



o 
o 



5i 

P CO 
CO O 

a. 

>- 

o . 
o o 



< 



< 
> 



UJ -IS 



CNJ CM 

tr> in 
u-> ir> 





CD 


o 






r*. 


r«» 


ro 




oo 


(O 


CM 


IT) 


<o 


lO 


CM 


ID 


o> 


CM 


CM 


*cr 


CO 






o 




r> 


ro 






CJ> 


to 






O 


r«- 




lO 


CM 


r- 




»o 


ir> 








oo 


<jy 




o 


O 


O 


I- 




q: 




z 


i 


UJ 


q: 


UJ 




X 


UJ 


Q 




o 


X 




CD 


< 


H- 


CO 




UJ 

1- 


O 










o 




CM 


CO 




37 



EP1 136 918 A1 



LU 

00 



3 
cr 



O Q 

u- q: 

a ^ 
< ^ 

t ^ 



CT) 

on 
o> 



Ct 
UJ 

S cr 
Iff 

ll q: 5 
o o o 

w O CL 

cr o 
< h- 




cn 
cn 



4 

LU 

a 
< 



Q 
< 

a 

UJ 

cr, 

o 
o 

CQ 



O 
< 

:e 
on 
o 
u. 



o 
o 



I- o 

H- CO 
CO O 

DL 

o . 
o o 



o 



LU 

-J 

< 

-I 
O 

q: 

z 
o 
a 

UJ 
3 



mi. 



CO 


o 


a> 


CO 


CO 


m 




oo 






CT> 


m 


00 


in 




CO 












lO 


CO 


CO 


CO 


in 




CO 




in 


in 


CO 


CO 


ay 


iO 


in 




in 












oo 


CsJ 


in 




CO 


























o 


co 


oo 






CO 








O) 




o> 




cr> 




CO 








00 




CO 












in 








tn 




oo 




CO 








O) 




^ 


CNI 


CO 


CM 


to 
















<M 




to 




a> 




o> 




o 


O 




O 


o 




o 




o 




r- 


OO 




o 




















CM 













S Ouj 

1 1 1 



^ OJ CO 



to 



CO 



o 

CL 



Z O 

o o 
> 

< _ 
cr ^ 
cr < 

< > 



CO 

cr 

LU 

Z 

O 
Q. 




O — CM 



LU 



CD 

o> 

CJ> 

cn 
o> 



38 



EP1 136 918 A1 



< 
Q. 

O 
O 
p 

a 
z 
5 

< 
a 

LU 

cc 

a 
o 



CQ 

\- 
LJJ 
CO 



3 
00 
LU 

q: 



a: ^ 
o Q 
11- q: 

a 9 
H o 



CO 

tr 



O 

CL 
U. 

o 

CC 

< 



CO 

cr 

LU 

z 
o 



sis 
«^ 3= ^ 

O o X = ^ 

»— o 5 o » 
«^ tiJ ^ o * 





IT) to 

lo in 



O 

y o 

CD Z 
< h- o 

_||- CO 

0C/5O 

cr . ^ 



OS 







o 


r- 




r«- 


r»- 


co 


CM 


co 


{£> 


CM 


ir> 


CO 


U> 


CM 


in 


o> 


CM 


CM 




CO 






o 




r> 


CO 






o> 


<o 






O 


1^ 






CM 


r- 




to 


m 


r- 






GO 


o> 




o 


o 


o 




2 O 

o o 

< 3 
CC d 
cc < 

< > 



cr 












UJ 












1- 
z 




o 


CM 




O 


o 













o 



39 



EP1 136 918 A1 



G. 17 C 



Start 



SELECI m mFORMAIION BLOCK REGARDING _^140 
' SEr AS THE FIRST IKFORUAilON BLOCK 



OEIECI FIELD VALUE NUMBER OF ' 0' CORRESPONDING 

10 FIELD VALUE OF ' MALE' WIIHIN M VALUE 
CONTROL TABLE OF THE SPECIFIC INFORHAIIONBLOCK 



COUNT CORRESPONDING TO FIELD VALUE NUMBER 
OF ' 0' IS 632564, SO lOlAL NUMBER OF 
HALES IS OEIERtllNEO TO BE (32564 



mmm mm locations of pointers 

TO records IHHEREIN THE lU IS MALE. AND 
^EEP array of record numbers as RESULT SET 



SELECT INFORMATION SLOCK REGARDING ' AGE' 
AS SECOND SPECIFIC INFORIiATlOH BLOCK 
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FROM ARRAY OF POINTERS TO VALUE CONTROL TABLE OF SECOND 
SPECIFIC INFORMATION BLOCK, EXTRACI FIELD VALUE NUMBER 
CORRESPONDING TO RECORDS SPECIFIED IN RESULT SET 
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t: 



EXTRACT AGE RELATED TO THE EXTRACTED 
FIELD VALUE NUMBER 
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FI6. 19 



^ START ^ 



SELECT INFORUATION BLOCK RE6A8DIN6 
' OCCUPAIION ' AS [HE flKSf INFOIIUAriON BLOCK 



I 



USE IHE SEARCH CONOlllON OF ' OCCUPATION IS STUDENT ' 
TO CREATE FROM AMONG ALL DECORDS A RESULT SET CONTAINING 
THE RECORDS IN SHICK OCCUPATION IS STUDENT 
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SELECT THE INFORMATION BLOCK REGARDING ' SEX' AS THE 
SECOND INFORMATION BLOCK, AND SELECT IHE INFORMATION 
BLOCK REGARDING ' AGE' AS IHE THIRD INFORMATION SLOCK 
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SEQUENTIALLr EXTRACT POINTERS TO RECORDS 
FROM BEGINNING OF RESULT SET 
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USING THE EXTRACTED POINTERS TO RECORDS, 
GET IHE SEX FROM THE SECOND INFORMATION BLOCK 
AND THE AGE FROM THE THIRD INFORMATION BROCK 



178 



TOTALS FOR BOTH MALE AND FEMALE SEXES ARE 
INCREMENTED BY 1 FOR EACH EXTRACTION TO CALCULATE 
THE TOTAL EXTRACTED AGE FOR MALES AND FEMALES 
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DIVIDE TOTAL AGE FOR BOTH HALES AND 
FEMALES BY THE TOTAL TO CALCULATE THE 
AVERAGE AGE OF MALE AND FEMALE STUDENTS 
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END 
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FIG. 20 



c 



START 
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FOR THE TWO FIELDS OF SEX AND 
OCCUPATION. LOAD THE FIRST AND SECOND 
INFORMATION BLOCKS INTO MEMORY 



■190 



INIIIALIZE A 2 X4(2 fiO» BY < COLUMN) TWO-DIMENSIONAL 
ARRAY TO CONTAIN TABULATION DATA, CORRESPONDING TO 
TOTALS OF I AND 4 FIELD VALUE NUMBERS FOR SEX AND OCCUPATION 


_^192 
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SEQUENTIALLY EHRACT FIELD VALUE NUMBERS ql AND q2 FROM THE 
BEGINNING OF THE ARRAY OF POINTERS TO THE VALUE CONTROL TABLE 
OF THE FIRST AND SECOND INFOfiilATION BLOCKS. AND IDENTIFY AN 
ELEMENT P(ql,q2) OF THE TY/O-DIMENSIONAL ARRAY 


_^194 
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INCREMENT THE VALUE OF THE IDENTIFIED 
ELEMENT P{qt,q2} 
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FIG. 22 



LOAD INIO MEMOfllf A FIRST AND SECOND 
INFORMATION BLOCK FOR THE THREE FIELDS 
OF SEX. OCCUPATION AND AGE 



iNniALi2E m I xiii m sr i column) two-dimensional mkn m 

STORING TABULATION DATA CORRESPONDING TO 2 AND 4 AS THE TOTAL 

mmm of field valoe mmm for sek and occdPAiion 



FROM DEGINNING OF ARRAY OF POINTERS TO VALUE COOTROL TABLE OF THE 
FIRST AND SECOND INFORMATION BLOCKS, SEOVENTIALir EXTRACT FIELD 
VALUE NUIIBEfiS ql,q2 AND q3 TO lOENIIFy IHE COORDINATES (ql.Q2) OF 
AN ELEMENT OF IHE IINO-DIMENSIONAL ARRAY 
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INCREMENI BY I IHE VALUE OF THE ELEMENT 
PKql.q?) OF THE FIRST 
TWO-DIMENSIONAL ARRAY 
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ACQUIRE THE FIELD VALUE (AGE) CORRESPONDING 10 
THE FIELD VALUE NUMBER q3 WIIH RESPECT TO THE 
INFORMATION RESPONSIBLE REGARDING ' AGE' 



ADD THE ACQUIRED AGE TO THE VALUE OF 
ELEMENT P2(q1,q2) OF THE SECOND 
TWO-DIMENSIONAL AKRAY 
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CALCULATE •P2(q1,q2)/P1(ql,q2) AMONG THE VARIOUS 
ELEMENTS OF THE ItfO-DIMEMSIONAL ARRAY 
Pt AND THE TWO-DIMENSIONAL ARRAY P2 
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END 



45 



EP1 136 918 A1 



FIG. 23A 
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START ^ 



FIG. 27 



EXIfiACI RESULT SEI FOR THE FIRST SEARCH CONDITION 
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-220 



SELECT SECONO SPECIFIC INFORMATION BLOCK 



■222 



INniALI2E CATEGORY KUUBERS IN THE VALUE CONTROL 
TAeLE FOR THE SECOND SPECIFIC INFORMATION mi 



-224 



EXTRACT POINTERS TO RECORDS FROM THE 
RESULT SET 



FOe THE SECOND SPECIFIC INFOEMAIION BLOCK. 
E^R^CT FIELD VALUE NUMBES CORRESPONDING 
10 IHE POINTERS 10 RECORDS 



■226 



■228 




SET CATEGORY NUM8ER TO 




YES 



234 



SEI CATEGORY NUMBER 10 0 



240 



ADD RECORD NUMBER TO FINAL RESULT SET 
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END 
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FIG. 29 
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DATA PREPARATION 
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GENERATE THE INFORMATION BLOCK FOR THE 
' SEX' FIELD 


^3 




r 






CREATE THE VALUE CONTROL TABLE 


^3 








CREATE ARRAY Of POINTERS TO VALUE CONTROL TABLE 
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CREATE ARRAY OF POINTERS TO RECORDS 



•314 



CONSTRUCTION OF ' SEX' FIELD 
INFORMATION BLOCK 



CONSTRUCTION OF ' AGE' FIELD 
INFORMATION BLOCK 



-320 



CONSTRyCTION OF ' OCCUPATION' FIELD 
INFORMATION BLOCK 
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END 
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